Meta:Problems in the LaTeX2HTML2Wiki import process

WARNING: this is a quick jot down of the minor beauty glitches in the import process of certain information to the Mediawiki environment. Not all problems are listed, and this is by no means a concise reference in the matter.


Import process

The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.

  • bold text
  • italic text
  • headers
  • excess line breaks (also with <br />)
  • graybox division around verbatim text areas
  • excess indexes in the beginning of the article
  • random verbatim code blocks after a header or a bulleted list

How to help

You can use different tools to help with the integration of the imported documents to Maemo wiki in several different ways. There are multiple tools available, for example

There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.

  • bold text the transformation of LaTeX to HTML transforms the environment textbf to <span class="textbf"></span>. Because this is a way of implementation in the LaTeX2HTML, it isn't transferrable to wiki and the format '''Bold text'''. This should be fixed.
  • italic text the transformation of LaTeX to HTML transforms the environment textit to <span class="textit"></span>. Because this is a way of implementation in the LaTeX2HTML, it isn't transferrable to wiki and the format ''Italic text''. This should be fixed.
  • headers The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is =<span class="arabic">2</span>.<span class="arabic">1</span> Getting started=

which in reality should be

=Getting started=

  • excess line breaks
  • graybox division around verbatim text areas
  • excess indexes in the beginning of the article
  • random verbatim code blocks after a header or a bulleted list