How to identify OCR converted word documents?

G

Guest

Hi

Can anyone tell me about the technical issues/points that I should look for in a word document that has been created using an OCR application. I know, for example, that many OCR applications insert hidden symbols like optional hyphens, em & en dashes, and some other symbols that are not visible in normal mode, but when show/hide button is activated, they become visible. Are there any hidden or technical features that are also embedded or included in converted documents? I would greatly appreciate any help in this regard. Thanks

-Anu
 
G

Graham Mayor

Convert it to plain text and there won't be any formatting horrors. OCR
applications generally make a dog's breakfast of formatting Word documents -
arguably the best is Finereader - certainly from version 5 on. I don't know
of any instance where 'hidden symbols' have been inserted otherwise than in
making a cock-up of the conversion.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><
Graham Mayor - Word MVP

My web site www.gmayor.com
Word MVP web site www.mvps.org/word
<>>< ><<> ><<> <>>< ><<> <>>< <>><
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top