First build with automatic dewarping

Tulon · Post by **Tulon** » 03 Jan 2011, 05:30

n9yty wrote:Looks like my changes to fix the crash on the Mac with zones never made it into the source repository

True. I seem to remember I was willing to accept the patch based on timings, rather than the one based on intercepting all kinds of events of QMenu. If you re-submit that patch, I may apply it.
BTW, I've recently filled the appropriate suggestion to fix or extend the QMenu API to the Qt bugtracker. Don't hold your breath though - it took them like a year to acknowledge and fix the previous bug I reported to them.

Tulon · Post by **Tulon** » 03 Jan 2011, 06:02

eL_PuSHeR wrote:I am beginning to think it's some issue in Win7.

No, I found and fixed the problem. It only happened in Color/Grayscale mode with illumination equalization turned off, which is why I couldn't reproduce it. Problems with crash reporting might well be Windows 7 related though.

rob · Post by **rob** » 03 Jan 2011, 10:07

Tulon wrote:Rob, I think RANSAC should work really well for selecting two representative lines. You basically pick two at random, build a distortion model based on them and check how well other lines fit into that model. Repeat the process a number of times and go with the best attempt. What do you think?

I didn't realize it at the time, but I used a RANSAC algorithm to choose lines for dekeystoning in my original program. It seemed to work very well, so it might be a good idea to consider.

I think what I did was generate vertical margins based on the topmost longest and bottommost longest lines, then checked all the lines to make sure that they did not go too far outside or too far inside those margins. (I emphasize too far inside because I assumed a uniform block of text) If a line did, remove it, and try again until a good fit is obtained. "Topmost" and "bottommost" didn't mean the absolute top and bottom line, but any line in the top 1/3 of the page, and any line in the bottom 1/3 of the page. The algorithm is more heuristic than anything else.

One other thing I'm considering -- a preprocessor which uses a checkerboard pattern to correct for keystoning (but not for warping). The idea would be that you could place a checkerboard on top of the platen every so often. After all your images are taken, a preprocessor program would run through the images, and if it detects a checkerboard, detect the corners of the checkerboard, compute a dekeystoning matrix, and correct any following images.

I had originally thought that the dekeystoning could be applied more smoothly by interpolating between checkerboard pages, but then I realized that any time you changed the location, orientation, or zoom of the camera, you would have to insert a checkerboard which would only be valid for the next pages, not the previous pages.

Now, OpenCV has all this code built in. And it's C. You might be able to add the OpenCV library to Scan Tailor and do this within Scan Tailor, but I could probably write a standalone program that could be run before ST, to give ST an easier time with the images.

Anyway, those are some random thoughts.

Tulon · Post by **Tulon** » 03 Jan 2011, 10:32

So, I think I've got vertical bounds detection right this time. It works well both for easy and hard cases, and I managed to avoid any heuristics.
Here is a new build: http://depositfiles.com/files/o52qhslyi
Below are some sample results.
Now I'll try to implement RANSAC for selecting the representative text lines. BTW, I've added more visualizations to the process of vertical boundaries detection, to make it easier to understand what's going on.

: vertical_bounds_1.jpg (96.21 KiB) Viewed 8072 times

: vertical_bounds_2.jpg (106.64 KiB) Viewed 8072 times

: vertical_bounds_3.jpg (57.12 KiB) Viewed 8072 times

Post by **daniel_reetz** » 03 Jan 2011, 10:53

That's great -- can't wait to try it. I've been abusing Scan Tailor with some really sub-optimal input, and still getting mostly good results, but I've had quite a few outputs like this one, where the top line is the obvious problem:

Tulon · Post by **Tulon** » 03 Jan 2011, 11:00

I'd like the source image for testing.

Post by **daniel_reetz** » 03 Jan 2011, 11:17

Sure. "CIMG0869" from this directory: http://www.diybookscanner.org/for_tulon/ - DPI should be 238.

I've included some others so you can see how awful this input is - it's quite literally pointing a camera at a book under mixed color temperature lighting. The book is in the public domain, so it's fine to share far and wide.

I am planning on doing some more sane testing under better controlled conditions when I get home from work tonight.

rob · Post by **rob** » 03 Jan 2011, 12:39

That's got to be one of your water-logged books

matt · Post by **matt** » 03 Jan 2011, 13:31

Awesome work, Tulon -- the auto-dewarping is pretty magical!

One thing I've noticed with the betas is that I have been seeing a number of cases like above (using the latest version from Git).

Thanks again for all of your work!

matt · Post by **matt** » 03 Jan 2011, 13:47

Just noticed another example of distortion using beta 5.

DIY Book Scanner

First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping

Re: First build with automatic dewarping