Thursday, November 6, 2008

Russian text from my script to blogger via email

It looks like I still need to make Russian text display correctly in blogger from my script. I will just delete and re-upload the pages sometime in the future. After all it done you will see a live person translation of each page. The reason why I haven't uploaded the whole book is because blogger send a captcha and my app can't get it. I will have to display the image and send another request/post. Will see as time goes by. I have other thing that are higher priority that I need to take care of first.

1. Proofreading the ocr'd text (Working on application to make proofreading a snap, CI )
2. Abstract Translation Script
3. Add pyGTK/wxPython User interface( select range of files and send to translators, select a directory to write files to )I know this is easy, but I don't know pyGTK that well as of yet. Keep learning. I have some books to read.
4. Create paperback of translated book.

People spending time on this Project so far:
Ed Goretoy(My Cousin)
Alex Goretoy(My Cousin)
Aleksandr Goretoy(me)
Some other people(Still need to find out who that was)

You could say it takes about 24 hours to scan and ocr a book with a friend(I broke up the time between days, of course).

A few days for the programs to translate the ocr'd text(that really depends on some other thing).

Who knows how much time to proofread(need to create web app to proofread ocr'd text,working on that now).

Translate text so it is understandable and the original message gets across to the reader.(May take a while) If you want to proof read some pages from the book, Please let me know. I want to proofread it at least 3 times per page with my application.

This project is also taking place on http://goretoy.info CHECK IT OUT

You can donate to us for our time on this public project. Slowly but surely you will be able to read the translated text here online or purchase a paperback.

No comments:

Unique Visits: PageLoads: