C
Chris
Is there a program to convert .pdf to .txt?
Chris said:Is there a program to convert .pdf to .txt?
wald said:XPdf has a "pdftotext" tool that does just that, besides
"pdfimages", "pdftops" and "pdffonts". It's open source, does the
job perfectly.
http://www.foolabs.com/xpdf/download.html
Regards,
Wald
just did a google search and give this one a try. its demo
http://www.verypdf.com/pdf2txt/pdf2txt.htm
wald said:XPdf has a "pdftotext" tool that does just that, besides
"pdfimages", "pdftops" and "pdffonts". It's open source, does the
job perfectly.
http://www.foolabs.com/xpdf/download.html
Is there a program to convert .pdf to .txt?
John Corliss said:Unfortunately, it works no better than copying text in Acrobat
Reader then pasting it into Wordpad, or simply doing "File",
"Save as Text..." in that it loses spaces at the beginning of a
line as well as double hard returns.
wald said:Well, in case you want to preserve formatting as much as possible,
it's probably better to use something like pdftohtml
(http://pdftohtml.sourceforge.net/), which converts PDF to... euh,
well, HTML![]()
CharlieDontSurf said:Easy PDF to Text Converter
Easy PDF to Text Converter is freeware. It works well on Windows
98/ME/2000/NT/XP Platform.
Features of Easy PDF to Text Converter
* Supports PDF to Text file conversion
* Convert batches of PDF files to Text files at one time
* Processes the conversion with very high speed
* Does NOT need Adobe Acrobat software
* Keeps original page layout when convert pdf to text
* Support drag and drop files
* Support PDF1.5 protocol (formerly only supported by Acrobat6.0)
* Works well on Win98/ME/NT/2000/XP platforms
* Userfriendly interface and easy to use!
http://www.pdf-to-html-word.com/pdf-to-text/
CharlieDontSurf said:Easy PDF to Text Converter
Easy PDF to Text Converter is freeware. It works well on Windows
98/ME/2000/NT/XP Platform.
Features of Easy PDF to Text Converter
* Supports PDF to Text file conversion
* Convert batches of PDF files to Text files at one time
* Processes the conversion with very high speed
* Does NOT need Adobe Acrobat software
* Keeps original page layout when convert pdf to text
* Support drag and drop files
* Support PDF1.5 protocol (formerly only supported by Acrobat6.0)
* Works well on Win98/ME/NT/2000/XP platforms
* Userfriendly interface and easy to use!
http://www.pdf-to-html-word.com/pdf-to-text/
wald said:Well, in case you want to preserve formatting as much as possible,
it's probably better to use something like pdftohtml
(http://pdftohtml.sourceforge.net/), which converts PDF to... euh,
well, HTML![]()
CharlieDontSurf,
I downloaded and installed this program. It's nice and the install is
fairly clean, but when I converted one .pdf document it lost a lot of
spaces between words. It didn't do this in all files that I converted
however.
Also, in any multipage .pdf file that I tried to convert to text, it
saved each page as a separate text file. Appending those pages to each
other is a real pain. Still, it comes closest to anything I've seen in
this thread to keeping original page layout.
--
Regards from John Corliss
I don't reply to trolls. No adware, cdware, commercial software,
crippleware, demoware, nagware, PROmotionware, shareware, spyware,
time-limited software, trialware, viruses or warez please.
John Corliss said:Unfortunately, it works no better than copying text in Acrobat
Reader then pasting it into Wordpad, or simply doing "File",
"Save as Text..." in that it loses spaces at the beginning of a
line as well as double hard returns.
wald said:Just to be sure... have you looked at the available options
(pdftotext -h)? The -layout option looks like it might improve the
results for you, although I haven't tested it.
Michael said:John Corliss wrote, twice: ...
I don't know if you've noticed, but your latest postings appear twice.
[email protected] said:CharlieDontSurf,
I downloaded and installed this program. It's nice and the install is
fairly clean, but when I converted one .pdf document it lost a lot of
spaces between words. It didn't do this in all files that I converted
however.
Also, in any multipage .pdf file that I tried to convert to text, it
saved each page as a separate text file. Appending those pages to each
other is a real pain. Still, it comes closest to anything I've seen in
this thread to keeping original page layout.