OCR Suggestions

B

Big Johnny

Can anyone suggest the best freeware OCR program -- preferably one that can
work with both my scanner and a graphic or pdf file?

Or if anyone knows of a list comparing the different freeware OCR programs,
that would be great.

Thanks for your help...

Big J
 
J

Johnny

Can anyone suggest the best freeware OCR program -- preferably one that can
work with both my scanner and a graphic or pdf file?

Or if anyone knows of a list comparing the different freeware OCR programs,
that would be great.

Thanks for your help...

Big J

Free OCR softwares:

GOCR/JOCR v0.40 - 194 KB
GOCR/JOCR is an OCR (Optical Character Recognition) program, developed
under the GNU Public License. Joerg Schulenburg started the program,
and now leads a team of developers.
GOCR can be used with different front-ends, which makes it very easy
to
port to different OSes and architectures. It can open many different
image formats, and its quality have been improving in a daily basis.
The original name is GOCR. It's what is used internally in the
sources.
But, when registering the site at Sourceforge, gocr was already taken.
So, it's kind of both. Yeah, we know.
http://jocr.sourceforge.net/scr_option.gif
Jörg Schulenburg
(e-mail address removed)-magdeburg.de
http://jocr.sourceforge.net/index.html
http://www-e.uni-magdeburg.de/jschulen/ocr/index.html
http://www-e.uni-magdeburg.de/jschulen/ocr/gocr040exe.zip


Graph OCR - 145 KB
A program for getting numeric data from scaned graphics.
http://www.moskvin.biz/ss/lineocr.gif
D. B. Moskvin
(e-mail address removed)
http://www.moskvin.biz/attic.php
http://www.moskvin.biz/free/grocr.zip


OmniFormat v7.5 - 5319 KB
OmniFormat is a free document conversion utility which allows dynamic
conversion and image manipulation of over 75 file formats including
HTML, DOC, XLS, WPD, PDF, XML, JPG, GIF, TIF, PNG, PCX, PPT, PS, TXT,
Photo CD, FAX and MPEG. OmniFormat supports Optical Character
Recognition (OCR) and may also be used to convert images and documents
to rights managed PDF files.
Omniformat requires that Pdf995 - also FREE - be installed. Pdf995 is
the fast, affordable way to create professional-quality documents in
the popular PDF file format. Its easy-to-use interface allows you to
create PDF files by simply selecting the "print" command from any
application, creating documents which can be viewed on any computer
with a PDF viewer.
The OmniFormat OCR Module enables OmniFormat to automatically convert
scanned images to text when the TXT output format is selected in
OmniFormat. The OCR Module will process all import formats handled by
OmniFormat. It can also extract text from PDF files and be run from
the
command line.
We support Windows 95, 98, 2000 and Me, NT 4.0 and XP.
http://www.pctipp.ch/library/graphics/categories/downloads/dl/24948_2.JPG
Software995
(e-mail address removed)
http://www.omniformat.com/
http://www.freeware995.com/omniformat/omniformat.exe
OCR Module v2.5 - 520 KB:
http://www.freeware995.com/omniformat/ocrmodule.exe


SimpleOCR v3.1 - 9511 KB
Do you dread having to retype that document you are holding in your
hand? If only you had the electronic file, your life would be so much
easier. With SimpleOCR, you could easily and accurately convert that
paper document into editable electronic text for use in any
application
including Word and WordPerfect.
Not only is SimpleOCR up to 99% accurate, it is 100% free.
Features:
- Huge Dictionary - With more than 120,000 words, it is unlikely that
SimpleOCR will run into a word it does not know. In the rare event
that it does not, our improved text editor allows you to easily add
the
new word to the dictionary. By adding new words to the dictionary,
SimpleOCR becomes better with every use.
- Despeckle - For those documents which are not particularly clear
(i.e. faxes, copies of copies, ...), SimpleOCR provides a despeckle or
"noisy document" option which increases SimpleOCR's accuracy.
- Format Retention - SimpleOCR can keep certain elements of the
document's format in the recognized document. From varying font sizes
to font formatting elements such as underline, italic, and bold,
SimpleOCR recognizes it all. For certain documents, it retains the
original document's format with up to 99% accuracy.
- Image Retention - Along with the document's text, SimpleOCR has the
uncanny ability to capture and retain pictures from the document. This
is a great feature which reduces the need to import images from a
document by other means.
- Plain Text Extraction - Just need the plain text from the original
document? No problem. SimpleOCR can be set to recognize the
characters and words but ignore the formatting. The resulting file is
ready for your word processor or your HTML/web editor and your own
custom formatting.
- Simplified Error Correction - Our text editor highlights suspected
errors in the recognized text for easier correction. This simplifies
the otherwise time-consuming task of proof reading the recognized text
for errors. But because SimpleOCR has up to 99% accuracy, you may
never need this feature.
- Batch OCR - Do you have several documents to OCR? Just point
SimpleOCR to them and it will OCR them from start to finish without
delay.
- Zone OCR - Sometimes all you may need is to extract the text from a
certain area in a document. Maybe one column. Maybe a footnote.
Maybe just one paragraph. Unlike other OCR applications, SimpleOCR
can
limits its OCR ability to a user defined area. There is no need to
OCR
an entire document only to use a small portion of it. With SimpleOCR,
OCR only what you need.
- Input Formats - SimpleOCR works with all fully compliant TWAIN
scanners and also accepts input from TIFF files.
- Output Formats - SimpleOCR can save the documents it acquires in
text
formats (TXT and RTF) importable into most every program such as Word,
WordPerfect, HTML editors, and e-mail programs, either fully formatted
or as plain text. Additionally, it can save scanned documents in the
industry standard TIFF format, a format as widely accepted as PDF
files.
- Multiple Language Recognition - SimpleOCR currently supports English
and French recognition. We are in the process of adding recognition
for additional languages.
- System Requirements SimpleOCR works on any PC with either Windows
95, 98, NT4, 2000, or XP. Your scanner need only a TWAIN driver, the
driver that comes with a majority of all scanners sold. In short,
SimpleOCR will most likely work with the PC and scanner you already
have.
- Pricing Our software is free for all non-commercial uses.
http://www.simpleocr.com/images/ScreenShot.gif
ScanStore.com
(e-mail address removed)
http://www.simpleocr.com/
http://www.charactell.com/scanstore/InstSocr.exe


Transcript v2.1.1.28 - 1204 KB
Transcript is a program I made to simplify the transcription of old
genealogical and historical documents but it can be used to transcribe
all digital documents which cannot be transcribed using OCR.
You could for example use it to transcribe the text from an old
newspaper, your diary, handwritten notes or any other documents you
have.
I found it very inconvenient that I always needed to switch between my
editor and image viewer to move the digital photograph of my document
to the part I needed to transcribe.previous or the next image.
Besides this, there are many other options to manipulate text and
image. It also remembers the position where you were working the last
time and will go back there when starting up the next time.
The editor uses the standard "rich text" format which is supported by
most editors so exchanging documents will be easy.
Transcript is free for private use only.
http://home.wanadoo.nl/jgboerema/en/Images/Transcript.jpg
J.G. Boerema
(e-mail address removed)
http://home.wanadoo.nl/jgboerema/en/Freeware.htm#Trans
http://home.wanadoo.nl/jgboerema/Downloads/Installer_Transcript2.1.exe
plugin - 342+440 KB:
http://home.wanadoo.nl/jgboerema/Downloads/ExifViewer.zip
http://home.wanadoo.nl/jgboerema/Downloads/ImageViewerPlugin.zip


WOCAR v2.5 - 1987 KB
WOCAR is an Optical Characters Recognition Application (OCR). It
converts scanned documents to text documents. The software can process
documents written in english or in french. WOCAR can work with any
scanner that supports the TWAIN interface. I can also process any
bilevel TIFF image file. This application works on Windows 95 and
Windows NT.
Special requirements: A TWAIN scanner is useful.
Note: Newest version's name: SimpleOCR
Cyrill Cambien
(e-mail address removed)
http://www.simtel.net/product.php[url_fb_product_page]28825
ftp://ftp.forthnet.gr/pub/simtelnet/win95/grafmisc/wocar250.zip
_
Johnny from hell: http://camorraecamorristi.napolionline.org
_
Johnny from hell: http://camorraecamorristi.napolionline.org
 
S

socrtwo

I like the online service provided by the National Library of Medicine
at NIH. The service is called DocMorph
(http://docmorph.nlm.nih.gov/docmorph/synthesizedinstructions.htm).

I also use the OCR feature in my MS Office 2003 Tools menu in the
program called Microsoft Office Document Imaging. It's a big
improvement over the XP version of the same applet. You get OCR text
by copying and pasting in images with text using the "Paste Page"
choice from the Page Menu. After that you simply choose "Send text to
Word" from the Tools Menu. You can also open certain varieties of Tiff
files to do the same thing. If you know what this means (and
admittedly I kinda don't too) the variety of TIFF file that always
seems to work is the uncompressed version (choose this variety if you
are using IrfanView to prepare the Tiffs).

I think DocMorph is still better.

By the way, nice work by Johnny's freeware listing.
 
L

lisztfr

My conviction is that there is no good OCR as freeware, for
what platform ever. For linux there is a small tool, 100-300ko,
CLI.
Simple Ocr and Wocar are just jokes, i tried to get them but
can't remember what happens, anyway.... If it would not be
jokes, you could post less text for describing them :)
Get a scanner, usually a OCR is shipped with. Alternatives
solutions exists, but can't be legal of course.

laurent
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top