Thanks all for the support- I feel pretty lucky that this actually gets to be my job (at least, my current assignment). On that note, some job complications have slowed progress momentarily, but I did just find some nice reusable code for solving an annoying issue where the cameras randomly get assigned USB identifiers. Progress is being made, our hope is that staff will be using the system starting very soon and I'll post code up once we've done a few actual runs.
I didn't mean to denigrate Decapod, their goals are just really different- they're committed to straightening pages in new and interesting ways, and are working within an established code base/community; where my job is basically to get a bunch of existing text scanned ASAP, however quick/dirty the process may be. Turning it into something more is totally just because I'm excited about the possibilities, too...
I am really stoked to continue on with this project and move towards a LiveCD. I think as a format generally they're totally underutilized... back in '05 I was part of a group that was working on a LiveCD for webcasting/microradio; though we never made it past an early version:
http://www.auppix.org still has the background info, it's something I'd still like to revisit someday (though obviously some more sophisticated projects have emerged since then).
Dan- if you haven't spent much time playing with Linux recently, you're going to be pleasantly surprised. It's an actual grown-up OS now, rather than what it was just a couple of years ago. It's at least as intuitive and more accessible than Win or Mac except, unfortunately, for certain specialized applications (particularly those where one or two software companies dominate, like video or audio production). Oh, and I've been completely unsuccessful in getting a PVR system to work even a little bit (though watching TV is noprob).

The major issue, as you may have already discovered, is that while the tech is open, documentation is lacking throughout.
Hey
cathal_magus, can you PM me an email address?
Tim I'm looking forward to exploring CHDK/SDM options... if either is compatible with ghpoto2, this would open up both a ton of new control options AND mean a whole slew of new cameras are supported! If it's not yet compatible, perhaps this is something we as a community can help facilitate...
My posts do not represent opinions of my employer. Statements regarding the law should be construed only as information; for IP advice consult a lawyer, fnord. No attorney-client relationship has been created nor should be inferred.