Meta:Problems in the LaTeX2HTML2Wiki import process

(How to help)
(removed nonissues)
Line 5: Line 5:
The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.
The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.
-
* bold text
 
-
* italic text
 
* headers
* headers
* excess line breaks (also with <nowiki><br /></nowiki>)
* excess line breaks (also with <nowiki><br /></nowiki>)
* graybox division around verbatim text areas
* graybox division around verbatim text areas
-
* excess indexes in the beginning of the article
 
* random verbatim code blocks after a header or a bulleted list
* random verbatim code blocks after a header or a bulleted list
Line 20: Line 17:
There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.
There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.
-
* '''bold text''' the transformation of LaTeX to HTML transforms the environment ''textbf'' to <nowiki><span class="textbf"></span></nowiki>. Because this is a way of implementation in the LaTeX2HTML, it isn't transferrable to wiki and the format <nowiki>'''Bold text'''</nowiki>. This should be fixed.
 
-
*'''italic text''' the transformation of LaTeX to HTML transforms the environment ''textit'' to <nowiki><span class="textit"></span></nowiki>. Because this is a way of implementation in the LaTeX2HTML, it isn't transferrable to wiki and the format <nowiki>''Italic text''</nowiki>. This should be fixed.
 
*'''headers''' The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is <br /><nowiki>=<span class="arabic">2</span>.<span class="arabic">1</span> Getting started=</nowiki><br /> which in reality should be <br /><nowiki>=Getting started=</nowiki>
*'''headers''' The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is <br /><nowiki>=<span class="arabic">2</span>.<span class="arabic">1</span> Getting started=</nowiki><br /> which in reality should be <br /><nowiki>=Getting started=</nowiki>
*'''excess line breaks''' There ''might'' be excess line breaks before and after code blocks.
*'''excess line breaks''' There ''might'' be excess line breaks before and after code blocks.
*'''graybox division around verbatim text areas''' The graybox <nowiki><div class="graybox"></nowiki> is not needed in the Wiki and could be removed.
*'''graybox division around verbatim text areas''' The graybox <nowiki><div class="graybox"></nowiki> is not needed in the Wiki and could be removed.
-
*'''excess indexes in the beginning of the article''' The export process generates an index which points to non-existant html pages and their link anchors. Mediawiki takes care of indexing based on the article subheaders, so these should be removed.
 
*'''random verbatim code blocks after a header or a bulleted list''' For some reason, the HTML2Wiki adds a whitespace to paragraphs starting immediately after a header or a bulleted list. These should be removed.
*'''random verbatim code blocks after a header or a bulleted list''' For some reason, the HTML2Wiki adds a whitespace to paragraphs starting immediately after a header or a bulleted list. These should be removed.

Revision as of 22:30, 26 February 2009

WARNING: this is a quick jot down of the minor beauty glitches in the import process of certain information to the Mediawiki environment. Not all problems are listed, and this is by no means a concise reference in the matter.


Import process

The information is currently imported to the wiki from LaTeX via HTML phase, as the tools available to convert LaTeX to Mediawiki format are unfinished at best. This process causes few minor glitches in the Wiki markup, and the articles imported need to be edited in any case to match even the rudimentary style conforms of the wiki.

  • headers
  • excess line breaks (also with <br />)
  • graybox division around verbatim text areas
  • random verbatim code blocks after a header or a bulleted list

How to help

You can use different tools to help with the integration of the imported documents to Maemo wiki in several different ways. There are multiple tools available, for example

There are some specific tasks to do to problems listed above. Most of them can and should be done when importing, but some problems may remain afterwards.

  • headers The top level header is always included with the process and it should be removed. Also, chapter numbers are imported, and they should be removed. An example of this is
    =<span class="arabic">2</span>.<span class="arabic">1</span> Getting started=
    which in reality should be
    =Getting started=
  • excess line breaks There might be excess line breaks before and after code blocks.
  • graybox division around verbatim text areas The graybox <div class="graybox"> is not needed in the Wiki and could be removed.
  • random verbatim code blocks after a header or a bulleted list For some reason, the HTML2Wiki adds a whitespace to paragraphs starting immediately after a header or a bulleted list. These should be removed.