I've noticed that Scan Tailor sometimes misses page numbers during the content selection phase, usually if there is a bit of whitespace separating them from the main page content. I'm not sure if this is a bug or if it's by design, but I thought I'd mention it so it might be improved. This image shows an example of a page where the page number was missed. The example is from 0.9.8 - I haven't had a chance to upgrade to 0.9.8.1 yet.
Edit: Here's another page with another problem in the same manuscript. Here, it identified the top of the first paragraph as the top of the page, cutting off a date and the page number as well as half of a photograph. I have other examples of photos being cut off as well.
