Meta:Problems in the LaTeX2HTML2Wiki import process

(How to help)
(slightly wikify, categorize)
 
(15 intermediate revisions not shown)
Line 1: Line 1:
-
WARNING: this is a quick jot down of the minor beauty glitches in the import process of certain information to the Mediawiki environment. Not all problems are listed, and this is by no means a concise reference in the matter.
+
WARNING, part 1: most of the issues ''listed'' on this page aren't experienced with the latest versions of the import toolchain, and as a result, this page should be considered as largely outdated. ''But''…
 +
 
 +
WARNING, part 2: this is a quick jot down of the minor beauty glitches in the import process of certain information to the Mediawiki environment. ''Not all problems are listed, and this is by no means a concise reference in the matter.''
Line 5: Line 7:
The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.
The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.
-
* bold text
 
-
* italic text
 
* headers
* headers
* excess line breaks (also with <nowiki><br /></nowiki>)
* excess line breaks (also with <nowiki><br /></nowiki>)
* graybox division around verbatim text areas
* graybox division around verbatim text areas
-
* excess indexes in the beginning of the article
 
* random verbatim code blocks after a header or a bulleted list
* random verbatim code blocks after a header or a bulleted list
== How to help ==
== How to help ==
You can use different tools to help with the integration of the imported documents to Maemo wiki in several different ways. There are multiple tools available, for example
You can use different tools to help with the integration of the imported documents to Maemo wiki in several different ways. There are multiple tools available, for example
-
* [http://en.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser AutoWikiBrowser]
+
* [[:wikipedia:Wikipedia:AutoWikiBrowser|AutoWikiBrowser]]
* [http://meta.wikimedia.org/wiki/Using_the_python_wikipediabot Bots]
* [http://meta.wikimedia.org/wiki/Using_the_python_wikipediabot Bots]
 +
 +
There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.
 +
 +
*'''headers''' The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is <br /><nowiki>==<span class="arabic">2</span>.<span class="arabic">1</span> Getting started==</nowiki><br /> which in reality should be <br /><nowiki>==Getting started==</nowiki>
 +
*'''excess line breaks''' There ''might'' be excess line breaks before and after code blocks.
 +
*'''graybox division around verbatim text areas''' The graybox <nowiki><div class="graybox"></nowiki> is not needed in the Wiki and could be removed.
 +
*'''random verbatim code blocks after a header or a bulleted list''' For some reason, the HTML2Wiki adds a whitespace to paragraphs starting immediately after a header or a bulleted list. These should be removed.
 +
 +
[[Category:Wiki]]

Latest revision as of 09:58, 10 May 2010

WARNING, part 1: most of the issues listed on this page aren't experienced with the latest versions of the import toolchain, and as a result, this page should be considered as largely outdated. But

WARNING, part 2: this is a quick jot down of the minor beauty glitches in the import process of certain information to the Mediawiki environment. Not all problems are listed, and this is by no means a concise reference in the matter.


[edit] Import process

The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.

  • headers
  • excess line breaks (also with <br />)
  • graybox division around verbatim text areas
  • random verbatim code blocks after a header or a bulleted list

[edit] How to help

You can use different tools to help with the integration of the imported documents to Maemo wiki in several different ways. There are multiple tools available, for example

There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.

  • headers The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is
    ==<span class="arabic">2</span>.<span class="arabic">1</span> Getting started==
    which in reality should be
    ==Getting started==
  • excess line breaks There might be excess line breaks before and after code blocks.
  • graybox division around verbatim text areas The graybox <div class="graybox"> is not needed in the Wiki and could be removed.
  • random verbatim code blocks after a header or a bulleted list For some reason, the HTML2Wiki adds a whitespace to paragraphs starting immediately after a header or a bulleted list. These should be removed.