GSoC 2010/Projects/page-reader

So far, unpaper and tesseract are ported and I have managed to convert images taken by the camera to text and output them via scripts from the command line. My current task is to get the image processing working correctly, so that the spoken text is more accurate.

...gotta go hit the road...will update in half hour...

Contents

[edit] Optical Page Reader for the N900 - GSoC 2010 Project

  • Student: David Jordan (dmj726)
  • Mentor: Florian Boor (florian)
  • Project Status: accepted, in development
  • Repository:
    • todo

Status updates for the project can be found on the blog: todo

[edit] Project summary

I plan to make an application for the Nokia n900 to allow blind and visually impaired people to read books, newspapers, magazines, signs, and other printed text on their own. The user simply opens the app and takes a picture of the document. The app then processes the image with an OCRFeeder-based backend and reads it aloud via text-to-speech software.

[edit] Progress

So far, unpaper and tesseract are ported and I have managed to convert images taken by the camera to text and output them via scripts from the command line. My current task is to get the image processing working correctly, so that the spoken text is more accurate. By the end of today, I should have this automated.

  • porting except for ocrfeeder and two image processing steps - done
  • ocrfeeder needs pygoocanvas, which compiles but needs packaging
  • camera app is mostly there but needs a few tweaks.
  • GUI prototype is done, but needs some experimentation with controls.

Filed application for garage project, but haven't gotten approval yet.

[edit] To-do

  • Finish porting and packaging OCRFeeder.
  • Automate image processing.
  • Tie this to the GUI.