How to convert a book to serchable pdf using open source software

Share your software workflow. Write up your tips and tricks on how to scan, digitize, OCR, and bind ebooks.

Moderator: peterZ

zbgns
Posts: 61
Joined: 22 Dec 2016, 06:07
E-book readers owned: Tolino, Kindle
Number of books owned: 600
Country: Poland

Re: How to convert a book to serchable pdf using open source software

Post by zbgns »

Nice to see that. I sought for free implementation of MRC and apparently here it is. Unfortunately, I was not able to test the program due to some dependency issues. Nevertheless, it looks very promising. Thank you for sharing info about this software with me.
Merlijn
Posts: 3
Joined: 16 Jul 2021, 14:57
Number of books owned: 0
Country: Netherlands

Re: How to convert a book to serchable pdf using open source software

Post by Merlijn »

zbgns wrote: 19 Jul 2021, 12:05 Nice to see that. I sought for free implementation of MRC and apparently here it is. Unfortunately, I was not able to test the program due to some dependency issues. Nevertheless, it looks very promising. Thank you for sharing info about this software with me.
Could you share the problem you ran into? You can also make an issue here: https://github.com/internetarchive/archive-pdf-tools

Thanks,
Cheers,
Merlijn
zbgns
Posts: 61
Joined: 22 Dec 2016, 06:07
E-book readers owned: Tolino, Kindle
Number of books owned: 600
Country: Poland

Re: How to convert a book to serchable pdf using open source software

Post by zbgns »

Well, actually I am not able to say, what is wrong. I expect that my lack of abilities is reason for that.
When I try to install 'archive-hocr-tools' I have this error message:

Code: Select all

$ python3 -m pip install setup.py 
Defaulting to user installation because normal site-packages is not writeable
ERROR: Could not find a version that satisfies the requirement setup.py (from versions: none)
ERROR: No matching distribution found for setup.py
Remaining requirements seem to be satisfied, but I am still no sure. Especially, I have Cython-0.29.24, whereas apparently the required version is 0.29.23.
Could you please provide a step-by-step install instruction for less advanced users? It would be really appreciated.
Many thanks for your interest and help.
Merlijn
Posts: 3
Joined: 16 Jul 2021, 14:57
Number of books owned: 0
Country: Netherlands

Re: How to convert a book to serchable pdf using open source software

Post by Merlijn »

The package is installable via pip now, via "pip install archive-pdf-tools"
Post Reply