TIFF compression options

Scan Tailor specific announcements, releases, workflows, tips, etc. NO FEATURE REQUESTS IN THIS FORUM, please.

TIFF compression options

Postby eL_PuSHeR » 14 Aug 2010, 05:49

Do ScanTailor just supports LZW compression in TIFF? What about others (e.g. ZIP)?

Not a big deal and working with LZW equals less compatibility issues. But I am just curious...
eL_PuSHeR
 
Posts: 123
Joined: 28 Jun 2010, 15:25

Re: TIFF compression options

Postby Tulon » 14 Aug 2010, 06:04

I always try to minimize the number of preferences, therefore I choose to have just one output format. When deciding on the format, the following issues were taken into account:
* Can't be lossy, so no JPEG.
* LizardTech's DjVu encoders didn't support PNG or TIFF Deflate (aka ZIP), so those are out too.
* TIFF G4 would be good for B/W output, but Photoshop gets its aspect ratio wrong.

So, TIFF LZW was the only option that doesn't have compatibility issues. Fortunately, by the time I started developing Scan Tailor, the patent on LZW compression had already expired.
When Scan Tailor asks you to enter DPIs manually, never enter arbitrary values. The video tutorial shows how to estimate the real DPI.
Tulon
 
Posts: 536
Joined: 03 Oct 2009, 06:13
Location: London, UK

Re: TIFF compression options

Postby eL_PuSHeR » 15 Aug 2010, 04:40

It seems reasonable. Thank you very much for your reply.
eL_PuSHeR
 
Posts: 123
Joined: 28 Jun 2010, 15:25

Re: TIFF compression options

Postby knappen » 30 Aug 2010, 15:03

A very, very elementary question:

Is TIFF more effectively compressed in a PDF conversion than JPG? I have tended to go through all the color and greyscale images and saved them as JPG as I thought this would result in a smaller end product before making the PDF with Adobe Pro, but suddenly I'm not sure that this is the way to proceed after all ...
knappen
 
Posts: 35
Joined: 29 Jul 2010, 20:21

Re: TIFF compression options

Postby Misty » 30 Aug 2010, 16:02

No, compressing the TIFFs to another format before using Acrobat Pro won't improve your compression. PDF uses several different compression methods, including JPEG and the superior JPEG2000, so it's actually better if you haven't used a lossy compression format before sending it to Acrobat. JPEG is also a poor format for Scan Tailor text compression, because it's pure black and white. Acrobat will select a better compression automatically.

If your TIFFs have both illustrations and text on the same page, you'll get the best compression by using a script that can separate layers like Strider1551's djvubind for DjVu, or my own PDF maker script that I'll be releasing in the near future.
The opinions expressed in this post are my own and do not necessarily represent those of the Canadian Museum for Human Rights.
User avatar
Misty
 
Posts: 481
Joined: 06 Nov 2009, 12:20
Location: Frozen Wasteland

Re: TIFF compression options

Postby spamsickle » 30 Aug 2010, 16:44

Is your PDF maker going to generate separate text and image layers? I assume you won't be doing real text with a font (so I still don't need to bother with an OCR step), but I also assume a text layer could use run length encoding or some other efficient compression on the binary portion of the image.
spamsickle
 
Posts: 577
Joined: 06 Jun 2009, 23:57

Re: TIFF compression options

Postby Misty » 30 Aug 2010, 17:07

That's right. It separates each page into two layers, one binarized text and one colour/greyscale image. The text is the scan as produced by Scan Tailor, not real text with a font. The text is compressed using lossy JBIG2, which is really efficient for pure black/white images like text. I'm using JPEG2000 for images. Text is used at full-res, but I'm offering the option of selecting a separate DPI for the text so you can get smaller PDFs by using screen-size downsampled images - 100DPI by default.

The first version won't include OCR; that will need to be done using Acrobat as a separate step by the user. I'm looking at the possibility of adding Tesseract to do OCR in a later version.
The opinions expressed in this post are my own and do not necessarily represent those of the Canadian Museum for Human Rights.
User avatar
Misty
 
Posts: 481
Joined: 06 Nov 2009, 12:20
Location: Frozen Wasteland

Re: TIFF compression options

Postby knappen » 30 Aug 2010, 17:13

Thanks, misty. I guess I have wasted time sorting out and converting illustration images that would have been better left alone.

Looking forward to the script.

Does anyone have a hot tip on a guide to getting a good view of different image formats and their respective use for cases like this?
knappen
 
Posts: 35
Joined: 29 Jul 2010, 20:21

Re: TIFF compression options

Postby Misty » 31 Aug 2010, 09:07

A guide like that would be handy! I'm not sure if there is one. I'm planning to do a little glossary on the different file/compression formats for my blog after I finish the post I'm putting together on camera selection.
The opinions expressed in this post are my own and do not necessarily represent those of the Canadian Museum for Human Rights.
User avatar
Misty
 
Posts: 481
Joined: 06 Nov 2009, 12:20
Location: Frozen Wasteland


Return to Scan Tailor

Who is online

Users browsing this forum: No registered users and 0 guests