C
Chris
Is there a program to convert .pdf to .txt?
Chris said:Is there a program to convert .pdf to .txt?
wald said:XPdf has a "pdftotext" tool that does just that, besides
"pdfimages", "pdftops" and "pdffonts". It's open source, does the
job perfectly.
http://www.foolabs.com/xpdf/download.html
Regards,
Wald
just did a google search and give this one a try. its demo
http://www.verypdf.com/pdf2txt/pdf2txt.htm
wald said:XPdf has a "pdftotext" tool that does just that, besides
"pdfimages", "pdftops" and "pdffonts". It's open source, does the
job perfectly.
http://www.foolabs.com/xpdf/download.html
Is there a program to convert .pdf to .txt?
John Corliss said:Unfortunately, it works no better than copying text in Acrobat
Reader then pasting it into Wordpad, or simply doing "File",
"Save as Text..." in that it loses spaces at the beginning of a
line as well as double hard returns.

wald said:Well, in case you want to preserve formatting as much as possible,
it's probably better to use something like pdftohtml
(http://pdftohtml.sourceforge.net/), which converts PDF to... euh,
well, HTML![]()
CharlieDontSurf said:Easy PDF to Text Converter
Easy PDF to Text Converter is freeware. It works well on Windows
98/ME/2000/NT/XP Platform.
Features of Easy PDF to Text Converter
* Supports PDF to Text file conversion
* Convert batches of PDF files to Text files at one time
* Processes the conversion with very high speed
* Does NOT need Adobe Acrobat software
* Keeps original page layout when convert pdf to text
* Support drag and drop files
* Support PDF1.5 protocol (formerly only supported by Acrobat6.0)
* Works well on Win98/ME/NT/2000/XP platforms
* Userfriendly interface and easy to use!
http://www.pdf-to-html-word.com/pdf-to-text/
CharlieDontSurf said:Easy PDF to Text Converter
Easy PDF to Text Converter is freeware. It works well on Windows
98/ME/2000/NT/XP Platform.
Features of Easy PDF to Text Converter
* Supports PDF to Text file conversion
* Convert batches of PDF files to Text files at one time
* Processes the conversion with very high speed
* Does NOT need Adobe Acrobat software
* Keeps original page layout when convert pdf to text
* Support drag and drop files
* Support PDF1.5 protocol (formerly only supported by Acrobat6.0)
* Works well on Win98/ME/NT/2000/XP platforms
* Userfriendly interface and easy to use!
http://www.pdf-to-html-word.com/pdf-to-text/
wald said:Well, in case you want to preserve formatting as much as possible,
it's probably better to use something like pdftohtml
(http://pdftohtml.sourceforge.net/), which converts PDF to... euh,
well, HTML![]()
CharlieDontSurf,
I downloaded and installed this program. It's nice and the install is
fairly clean, but when I converted one .pdf document it lost a lot of
spaces between words. It didn't do this in all files that I converted
however.
Also, in any multipage .pdf file that I tried to convert to text, it
saved each page as a separate text file. Appending those pages to each
other is a real pain. Still, it comes closest to anything I've seen in
this thread to keeping original page layout.
--
Regards from John Corliss
I don't reply to trolls. No adware, cdware, commercial software,
crippleware, demoware, nagware, PROmotionware, shareware, spyware,
time-limited software, trialware, viruses or warez please.
John Corliss said:Unfortunately, it works no better than copying text in Acrobat
Reader then pasting it into Wordpad, or simply doing "File",
"Save as Text..." in that it loses spaces at the beginning of a
line as well as double hard returns.
wald said:Just to be sure... have you looked at the available options
(pdftotext -h)? The -layout option looks like it might improve the
results for you, although I haven't tested it.
Michael said:John Corliss wrote, twice: ...
I don't know if you've noticed, but your latest postings appear twice.
[email protected] said:CharlieDontSurf,
I downloaded and installed this program. It's nice and the install is
fairly clean, but when I converted one .pdf document it lost a lot of
spaces between words. It didn't do this in all files that I converted
however.
Also, in any multipage .pdf file that I tried to convert to text, it
saved each page as a separate text file. Appending those pages to each
other is a real pain. Still, it comes closest to anything I've seen in
this thread to keeping original page layout.
Want to reply to this thread or ask your own question?
You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.