what is the best format for scanned documents

  • Thread starter Thread starter komar77r
  • Start date Start date
K

komar77r

that contains txt and lot of pictures
i heard about such a thing that automatically can choose
between best algorithm for different part of image, that is
text is compressed in grayscale in way similar to gif and
images are compresed in similar ways to jpeg, an all this is
contained in one file

i'm especially interested in these formats that are part of
irfan view:
ECW - Enhanced Compressed Wavelet
JP2 - Jpeg 2000
JPM
LDF - Lura Document Format
LWF - Lura Wave Formar

so... what is the best ?
[[ i don't need a losses format, i want simply pack as much
data as possible with high compression rate and regardless
of time and processing power ]]
 
that contains txt and lot of pictures
i heard about such a thing that automatically can choose
between best algorithm for different part of image, that is
text is compressed in grayscale in way similar to gif and
images are compresed in similar ways to jpeg, an all this is
contained in one file

i'm especially interested in these formats that are part of
irfan view:
ECW - Enhanced Compressed Wavelet
JP2 - Jpeg 2000
JPM
LDF - Lura Document Format
LWF - Lura Wave Formar

so... what is the best ?
[[ i don't need a losses format, i want simply pack as much
data as possible with high compression rate and regardless
of time and processing power ]]

Why not use TIFF with fax group 4 compression ?
More or less the industry standard :-)
 
but tiff's are (as i know) veeeery huge...

No. Tiff is just a name for a way to write files.

They can be uncompressed and huge, or well compressed like fax
Group 4: A4 or letter size business documents: 35 KB average at
300 dpi B/W.
 
What you are looking for is a format that saves text in binary format
(1-bit) and graphics in 24-bit color, all in the same file. This is called
"mixed raster content".

This is supported in PDF files using a method called "Adaptive Compression".

It is also supported by DjVu and in JPM files. JPM is "JPEG 2000 - Part 6".

Check out these webpages:
http://www.planetdjvu.com/djvu_vs__pdf_vs__jpeg2000_part_6__jpm_.htm
http://www.planetdjvu.com/pdf_adaptive_compression_edges_close_to_djvu.htm
http://www.searchpdf.com/presentation_of_new_adobe_acrobat_6_compression_methods_t.htm
http://www.planetdjvu.com/the_mrc__mixed_raster_content__model_and_djvu.htm

It's been a while since I looked into this so there I'm sure there have been
new developments.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Back
Top