thanks to rob's reply, I found a python script that works that looks ideal for Proof Reading OCR text (one page at a time + removes scannos that guigut/guiprep often misses - it is called proofer.py and can be DL from link [url]http://git.sugarlabs.org/e-book-making-scripts/mainline/trees/master
[/url]
Apparently proofer.py works beautifully in windows, but not able to use in MAC (as missing PyGTK)
if anyone knows how I could get this python script working in mac, i would love to know!
information sourced from:
ebook enlightment pdf available on web
The proofer.py utility requires PyGTK. While there is a PyGTK download for Windows, there is none for the Macintosh. PyGTK is included with every Linux distribution.
To download and install PyGTK for Windows you'll need to follow the instructions here:
http://www.pygtk.org/downloads.htmlOn Windows a version of GTK+ is included with The GIMP install, but is not adequate for running PyGTK. You'll need to uninstall it, install the new GTK+ bundle, and replace the PATH entry for GTK to point to the new one. If that sounds like a lot more work than you normally go through to install a Windows program it is. You may find running proofer.py on Windows more trouble than its worth. The other Python programs should still be useful on Windows.
The Python programs themselves can be downloaded here:
http://git.sugarlabs.org/e-book-making- ... ees/masterThe trick to downloading them is to click on the program name on this page, which will give you a formatted listing of the code. When you get that look to the upper right of that listing for a link named Raw blob data. Click on that to download the program.