OCR-ed text in TIF files is not indexed?

  • Thread starter Thread starter Jerry
  • Start date Start date
J

Jerry

I have a large document archive consisting of scanned files in TIF format,
where all text has been OCR-ed. With Windows XP I was using the Windows
Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.

After updating to Vista, that is no longer the case, in spite of my having
selected "index properties and file contents" option for TIF files. Is there
a way I can get this to work?
 
wait one moment...

if you have OCR'ed the TIF images.. then you have produced text files with
the writting from those images in some text format like txt, doc or rtf.

Those text files can be indexed.

If you want vista to READ the TIF IMAGES and understand the writting on
them, you are out of luck! lol thats not possible
 
The text in the image has to be converted to actual text to be indexed.
 
The text is stored in the same TIF file. Done using the Document Imaging
program (part of MS Office).
WDS 2.6 had no problem indexing it.
 
Jerry,

a tiff file is normally only an image file...

however I know that you can add a layer of text on it and annotations,
because you can do that on XP with the document fax and imaging viewer.

However if you view that tiff image with another viewer like irfanview the
annotations are not viewable....

So I dont know what to advise you... perhaps MS thought it was no longer a
standard , and removed that capability from the indexing ?
 
Text in the TIF format *is* part of the standard. Incidentally, the spec
fathers were Microsoft and Aldus (now part of Adobe).
MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
order to allow indexing of everything, metadata/text included.
I suspect something must be wrong with some obscure setting buried deep in
the Registry :-)
 
do you have office installed now?



Jerry said:
Text in the TIF format *is* part of the standard. Incidentally, the spec
fathers were Microsoft and Aldus (now part of Adobe).
MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
order to allow indexing of everything, metadata/text included.
I suspect something must be wrong with some obscure setting buried deep in
the Registry :-)
 
fathers were Microsoft and Aldus (now part of Adobe).

I read about the tiff file specs on wikipedia after you asked.... so I know
the story now
 
kirk jim wrote:


I read about the tiff file specs on wikipedia after you asked.... so I
know the story now

You read about something on wikipedia?
Shit...that explains everything.
Never mind.
Frank
 
Frank you are free to edit the articles and make them better if you
have more complete knowledge....
 
Back
Top