OCR-ed text in TIF files is not indexed?

J

Jerry

I have a large document archive consisting of scanned files in TIF format,
where all text has been OCR-ed. With Windows XP I was using the Windows
Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.

After updating to Vista, that is no longer the case, in spite of my having
selected "index properties and file contents" option for TIF files. Is there
a way I can get this to work?
 
K

kirk jim

wait one moment...

if you have OCR'ed the TIF images.. then you have produced text files with
the writting from those images in some text format like txt, doc or rtf.

Those text files can be indexed.

If you want vista to READ the TIF IMAGES and understand the writting on
them, you are out of luck! lol thats not possible
 
A

Andre Da Costa[ActiveWin]

The text in the image has to be converted to actual text to be indexed.
 
J

Jerry

The text is stored in the same TIF file. Done using the Document Imaging
program (part of MS Office).
WDS 2.6 had no problem indexing it.
 
K

kirk jim

Jerry,

a tiff file is normally only an image file...

however I know that you can add a layer of text on it and annotations,
because you can do that on XP with the document fax and imaging viewer.

However if you view that tiff image with another viewer like irfanview the
annotations are not viewable....

So I dont know what to advise you... perhaps MS thought it was no longer a
standard , and removed that capability from the indexing ?
 
J

Jerry

Text in the TIF format *is* part of the standard. Incidentally, the spec
fathers were Microsoft and Aldus (now part of Adobe).
MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
order to allow indexing of everything, metadata/text included.
I suspect something must be wrong with some obscure setting buried deep in
the Registry :)
 
K

kirk jim

do you have office installed now?



Jerry said:
Text in the TIF format *is* part of the standard. Incidentally, the spec
fathers were Microsoft and Aldus (now part of Adobe).
MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
order to allow indexing of everything, metadata/text included.
I suspect something must be wrong with some obscure setting buried deep in
the Registry :)
 
K

kirk jim

fathers were Microsoft and Aldus (now part of Adobe).

I read about the tiff file specs on wikipedia after you asked.... so I know
the story now
 
F

Frank

kirk jim wrote:


I read about the tiff file specs on wikipedia after you asked.... so I
know the story now

You read about something on wikipedia?
Shit...that explains everything.
Never mind.
Frank
 
K

kirk jim

Frank you are free to edit the articles and make them better if you
have more complete knowledge....
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top