Proposed: Book Scanning Races!

DIY Book Scanner Skunk Works. Share your crazy ideas and novel approaches. Home of the "3D structure of a book" thread.

Proposed: Book Scanning Races!

Postby StevePoling » 03 Oct 2009, 13:34

When enough of us have scanners up and running, we should figure out how to do book scanning races. A timed event where you take a book, scan it, OCR, then clean it up.

Start with a book list. (Ask someone from the Gutenberg project.) Everyone gets a set of books to scan drawn from this list and a stopwatch. He does his assigned books and submits his time for each book. Each book shall be scanned a total of 3 times by different contestants. Judges compare the dupes against each other to find errors and assess points.

This exercise would provide a bottom-line evaluation of hardware and software and operator workflow. We're pursuing different hardware and software options. Though we think we know what works better, self-deception tends to get squashed by race results. (Ask me about Pinewood derby racing someday.)

Winners would get bragging rights and (upon full disclosure of their winning formulas) a nice prize. And the participants (and only participants) would each get copies of the etexts. If we can get a judge to agree this is a fair use, we can include copyrighted works. And we could donate etexts to whatever charities provide books for the blind.

But the big win for everyone would be that we'd learn (and confirm) the relative merits of everything we've been discussing here.
StevePoling
 
Posts: 290
Joined: 20 Jun 2009, 12:19
Location: Grand Rapids, MI

Re: Proposed: Book Scanning Races!

Postby spamsickle » 04 Oct 2009, 16:35

It might provide some interesting data points, but I imagine that operator skill would probably be the determining factor, both in the scanning phase and the post-processing phase. For that reason, I don't expect you'd get any clear insights about which designs or software are superior from such an exercise.

Personally, I prefer cooperation to competition in this hobby. I appreciate everyone's contributions, even those I don't use myself. The heck with bragging rights; I just want a library I can carry on a spool of DVDs.

Which reminds me, after waiting for weeks for a free copy of the Gutenberg 2006 DVD that still hasn't arrived, I finally bit the bullet and downloaded their piecemeal ISO images to burn one of my own. If anyone would like a copy, PM me here.
spamsickle
 
Posts: 572
Joined: 06 Jun 2009, 23:57


Return to R&D and New Technologies

Who is online

Users browsing this forum: No registered users and 2 guests