Internationalize a Python application

(Generate template for the translators)
(Configure gettext Define the '_()' function)
 
(38 intermediate revisions not shown)
Line 1: Line 1:
-
[[Category:Internationalization]]
+
This page explains step by step how to add support for translations in a Python application for Maemo.
-
[[Category:Developers]]
+
-
[[Category:N900]]
+
-
= Internationalization of a python application (in Maemo) =
+
There are already some [http://www.learningpython.com/2006/12/03/translating-your-pythonpygtk-application/ tutorials online] about how to internationalize (i18n) Python applications, but I find them difficult to follow and they lack some needed code and final tricks to make everything work fine.  
-
 
+
-
This page explains step by step add support for translations in a python application for maemo.
+
-
 
+
-
There are already some tutorials online about how to internationalize (i18n) python applications, but i find them confusing and they lack some needed code and final tricks to make everything work fine. This is the steps i followed to add i18n in my project Mussorgsky.
+
To support i18n we need to accomplish 5 tasks:
To support i18n we need to accomplish 5 tasks:
-
# Define correctly a function '_()' that translates strings
+
# Define correctly a function '<code>_()</code>' that translates strings
# Mark the strings in the code with that function
# Mark the strings in the code with that function
# Generate a template for the translators
# Generate a template for the translators
# Add translations
# Add translations
-
# Include the translations in your installation
+
# Include the translations in the installation
== The overall process ==
== The overall process ==
-
From the source code, using a command line tool, we will generate a ".pot" file. It is plain text file containing all strings that need translation in the project. This .pot file only needs to be generated/recreated when the strings change in the code (and _not_ in every build!). There is only one .pot file per project.
+
From the source code, using a command line tool, we will generate a "<code>.pot</code>" file. It is plain text file containing all strings that need translation in the project. This <code>.pot</code> file only needs to be generated/recreated when the strings change in the code (and ''not'' in every build!). There is only one <code>.pot</code> file per project.
-
Then, from the .pot files and using another command line tool, we will generate the .po files for each language. There is one .po file per language, and it is the file that the translators need to complete. Both .pot and all .po files should be committed in the repository.
+
Then, from the <code>.pot</code> files and using another command line tool, we will generate the <code>.po</code> files for each language. There is one <code>.po</code> file per language, and it is the file that the translators need to complete. Both <code>.pot</code> and all <code>.po</code> files should be committed in the repository.
-
But once your program is running, it doesn't use directly .po files, but a binary (compiled) version of them: the .mo files. These .mo files must be re-created on build time (when creating the package) and installed in the right location in the system. These are generated files, so don't commit them in the repository.
+
But once your program is running, it doesn't use <code>.po</code> files directly, but a binary (compiled) version of them: the <code>.mo</code> files. These <code>.mo</code> files must be re-created on build time (when creating the package) and installed in the right location in the system. These are generated files, so don't commit them to the repository.
From the code point of view, you just need to call few functions to tell gettext where are the translation files, what language do you want to use (usually the locale of the environment), and then it will use its implementation to find the right string.
From the code point of view, you just need to call few functions to tell gettext where are the translation files, what language do you want to use (usually the locale of the environment), and then it will use its implementation to find the right string.
Line 28: Line 22:
Ok, this is what we need to do. Let's code.
Ok, this is what we need to do. Let's code.
-
== Configure gettext Define the '_()' function ==
 
-
TODO
 
== Mark strings for i18n ==
== Mark strings for i18n ==
-
This is one of the easiest parts. Browse the source code files that show something on the string, and wrap the visible strings (window titles, dialog titles, notifications, labels, button labels and so on) with the '_()' function.
+
This is one of the easiest parts. Browse the source code files that show something on the screen, and wrap the visible strings (window titles, dialog titles, notifications, labels, button labels and so on) with the '<code>_()</code>' function.
For example:
For example:
 +
<source lang="python">
 +
# import the _() function!
 +
import i18n
 +
_ = i18n.language.gettext
-
# import the _() function!
+
class Example (hildon.StackableWindow):
-
import i18n
+
-
_ = i18n.language.gettext
+
-
+
-
class Example (hildon.StackableWindow):
+
-
+
-
    def __init__ (self):
+
-
        hildon.StackableWindow.__init__ (self)
+
-
        self.set_title (_("All music"))
+
-
== Generate template for the translators ==
+
  def __init__ (self):
 +
      hildon.StackableWindow.__init__ (self)
 +
      self.set_title ( _("All music") )  <------- ADD THIS!
 +
</source>
 +
== How to add translations ==
-
Create a po folder in your project (usually at the same level as src/) and run the following command from the top directory of your project. You can do this inside or outside scratchbox, there is no difference.
+
Now you want to get a translation in an specific language. You need to generate a <code>.po</code> for it.
-
xgettext --language=Python --keyword=_ --output=po/PROJECTNAME.pot `find . -name "*.py"`
+
There are several ways to do it. I suggest to do it the easy way with poEdit:
 +
=== With poEdit ===
 +
Poedit is cross-platform gettext catalogs (.po files) editor. For more information, see its website http://www.poedit.net/.
 +
Just start it and generate a new catalog from the generated .pot file.
 +
In the option window you can select the translated language, your name, email address and so on. This is useful, so others know who created this translation.  
-
It will parse files, written in Python, looking for strings marked with the keyword (function name) '_' and saving the output in a file called 'po/PROJECTNAME.pot'. The list of files is the last argument. I like to use "find", but you can manually put a list of files there.
+
Now you can start to translate all the text. If you modify your application and generate a new .pot file, you can easily import it. It will use all the existing translation and add the new/modified strings. If you changed the charset before like me, you will have to select it again.
 +
Every time you save the file, poEdit will generate a .mo file. You can use it for testing the new translation.
-
That command will generate the .pot file. Easy, isn't it?
+
=== Manual way ===
 +
Go to the <code>po/</code> folder and run this command (again, inside/outside scratchbox doesn't matter):
-
== Include translations in your installation ==
+
msginit --input=PROJECTNAME.pot --locale=LOCALE
-
TODO
+
 
-
== How to add translations ==
+
For instance, if I want a <code>.po</code> file for the Spanish translation of my project Mussorgsky, I write:
-
TODO
+
 
 +
msginit --input=mussorgsky.pot --locale=es_ES
 +
 
 +
It will generate a file "<code>es.po</code>" and translators can/must work on that file.
 +
 
 +
'''IMPORTANT''': by default the generated <code>.po</code> file declares its charset as ASCII. This means that translations with non-ascii characters will provoke an error in the program. To solve this, set the charset to '<code>utf-8</code>':
 +
 
 +
"Language-Team: Spanish\n"
 +
"MIME-Version: 1.0\n"
 +
"Content-Type: text/plain; charset=utf-8\n"  <--- THIS MUST BE UTF-8
 +
 
 +
If you don't know the locale of a language, can check [http://www.roseindia.net/tutorials/I18N/locales-list.shtml www.roseindia.net/tutorials/I18N/locales-list.shtml] or [http://people.w3.org/rishida/utils/subtags/ people.w3.org/rishida/utils/subtags].

Latest revision as of 09:43, 4 May 2016

This page explains step by step how to add support for translations in a Python application for Maemo.

There are already some tutorials online about how to internationalize (i18n) Python applications, but I find them difficult to follow and they lack some needed code and final tricks to make everything work fine.

To support i18n we need to accomplish 5 tasks:

  1. Define correctly a function '_()' that translates strings
  2. Mark the strings in the code with that function
  3. Generate a template for the translators
  4. Add translations
  5. Include the translations in the installation

Contents

[edit] The overall process

From the source code, using a command line tool, we will generate a ".pot" file. It is plain text file containing all strings that need translation in the project. This .pot file only needs to be generated/recreated when the strings change in the code (and not in every build!). There is only one .pot file per project.

Then, from the .pot files and using another command line tool, we will generate the .po files for each language. There is one .po file per language, and it is the file that the translators need to complete. Both .pot and all .po files should be committed in the repository.

But once your program is running, it doesn't use .po files directly, but a binary (compiled) version of them: the .mo files. These .mo files must be re-created on build time (when creating the package) and installed in the right location in the system. These are generated files, so don't commit them to the repository.

From the code point of view, you just need to call few functions to tell gettext where are the translation files, what language do you want to use (usually the locale of the environment), and then it will use its implementation to find the right string.

Ok, this is what we need to do. Let's code.

[edit] Mark strings for i18n

This is one of the easiest parts. Browse the source code files that show something on the screen, and wrap the visible strings (window titles, dialog titles, notifications, labels, button labels and so on) with the '_()' function.

For example:

# import the _() function!
import i18n
_ = i18n.language.gettext
 
class Example (hildon.StackableWindow):
 
   def __init__ (self):
       hildon.StackableWindow.__init__ (self)
       self.set_title ( _("All music") )   <------- ADD THIS!

[edit] How to add translations

Now you want to get a translation in an specific language. You need to generate a .po for it.

There are several ways to do it. I suggest to do it the easy way with poEdit:

[edit] With poEdit

Poedit is cross-platform gettext catalogs (.po files) editor. For more information, see its website http://www.poedit.net/. Just start it and generate a new catalog from the generated .pot file. In the option window you can select the translated language, your name, email address and so on. This is useful, so others know who created this translation.

Now you can start to translate all the text. If you modify your application and generate a new .pot file, you can easily import it. It will use all the existing translation and add the new/modified strings. If you changed the charset before like me, you will have to select it again. Every time you save the file, poEdit will generate a .mo file. You can use it for testing the new translation.

[edit] Manual way

Go to the po/ folder and run this command (again, inside/outside scratchbox doesn't matter):

msginit --input=PROJECTNAME.pot --locale=LOCALE

For instance, if I want a .po file for the Spanish translation of my project Mussorgsky, I write:

msginit --input=mussorgsky.pot --locale=es_ES

It will generate a file "es.po" and translators can/must work on that file.

IMPORTANT: by default the generated .po file declares its charset as ASCII. This means that translations with non-ascii characters will provoke an error in the program. To solve this, set the charset to 'utf-8':

"Language-Team: Spanish\n"
"MIME-Version: 1.0\n"
"Content-Type: text/plain; charset=utf-8\n"  <--- THIS MUST BE UTF-8

If you don't know the locale of a language, can check www.roseindia.net/tutorials/I18N/locales-list.shtml or people.w3.org/rishida/utils/subtags.