I have books with plain text and others with lots of photos, graphics and charts, and want to end up with a high quality (archival or close), reasonably small-sized, fully searchable file to be read on an e-reader.
I don't actually have a scanner yet, but I just want to get a better idea of what's involved, what I'm getting myself into. From what I've gleaned so far the process goes something like:
1). Scan book pages using a scanner or camera. This is to get the images onto a computer.
2).The book page images are in image files, and need to be converted to text files
3) Text files can only (?) be created from image files through an OCR process
4). These OCR conversions need to then be proofed, and formated on a page(?) and compressed (?)
5) Then the proofed and compressed files can be converted to PDF (?)
6). The PDFs can be converted to an e-reader format like e-pub or .azw, or just left as PDFs.
Anyway, as you can see, except for the beginning and end, I'm not 100% sure about the order or what exactly is involved in each step.
Thanks in advance.
