PDF Plain Text Extractor

brett.shand · Feb 21, 2006

Foxit Reader. Click on Text Copy. Click the beginning of the
document, hold and wheel-scroll to the end. Click Edit/Copy. Not as
easy as running one program, but it only takes a few seconds to copy a
large document.

Actually you just have to use Cntrl-A to select all. But no formating
is preserved.

ms · Feb 21, 2006

Peter said:
ms - 21.02.2006 01:41 :

Susan, thanks for all the research. What a mess. I will try both of them, Foxit
was always a big dissapointment in this area.

Click to expand...

[~80 unnecessary quoting line snipped)]

"what a mess" of fullquoting again :-( Please learn to quote. Permanent
fullquoting is one of the most misbehavior in usenet. THX in advance for
your kind understanding.

You are a pest.

Why don't you notice that I NORMALLY snip, instead of one time I didn't remember to?

Mike Sa

Dan · Feb 21, 2006

Is there a prog that stiches together individual text files?

http://bluefive.pair.com/free95.htm
"TXTcollector grabs all .txt files from a directory of your choice and
combines them into a new (larger) text file. This can be handy: I use
TXTcollector myself to grab a CD full of readme's to enable me to read
them all in one go."

I haven't tried it. Looks promising for you.

Big Geeko · Feb 21, 2006

Dan pretended :

http://bluefive.pair.com/free95.htm
"TXTcollector grabs all .txt files from a directory of your choice
and combines them into a new (larger) text file. This can be handy: I
use TXTcollector myself to grab a CD full of readme's to enable me to
read them all in one go."

I haven't tried it. Looks promising for you.

xxxxxxxxxxxxxxxxxxxxxxxxhaxxxxxx
Hi
Tried and it joined the 540 files easily.
Each page had the file address and sequential file number on it. Tried
various text editors with a find and replace search, but no success.
Yes, I used wild card variable * for the different numbers, but no
success. Guess I will have to delete manually. Not complaining, it
has produced a fairly good result so far and cost nothing.
Thanks and regards
Jay

Al Klein · Feb 22, 2006

Yeah, the URL for the free version (I put it back) and *proof* that
these two apps are one and the same. In my *opinion* they are playing a
shell game - I strongly *suspect* they are simply pretending crippled
functions in the free app don't exist - "uncrippled" sounds so much
better than Crippleware. Then they tout the Trialware/payware app as if
it is a *different* app. FWIW - the downloads are close to the same size.

Susan, I can write 2 versions of a program, one with functions turned
off, with only a 1 BIT difference between them - if that bit is the
test bit for the turned-off functions. For instance, the Save
function starts:

IF <whatever the test condition for that bit is> then {
<do the save stuff>
}

The code is the same, whether the program is crippled or not - except
for the 1 bit the "condition" being tested for differs in. So the
length, even if it's identical to the byte, isn't an indicator of
"sameness".

(I can also write 2 versions of a program that do the same thing but
are different lengths.)

Al Klein · Feb 22, 2006

Actually you just have to use Cntrl-A to select all. But no formating
is preserved.

The original question was to extract the text - that implies no
formatting, since "text" doesn't have formatting, documents do.

Al Klein · Feb 22, 2006

ah well. . . one way to avoid the nightmare you're facing. . . if the
files are named in consecutive order TreePad (or KeyNote) can be used to
combine them.

Step 1. Open a new file in Treepad.
Step 2. Select all the text files and import them.
Step 3. Export the entire Treepad file as a single text file.

Note: The original file names will become node names in Treepad. Each
node name (aka file name) will be included above the text from the
original file when you export the Treepad file.

Susan, the old "copy" command can also combine files. While combining
540 files might wear one's nubs down, it can still be done. (A
program to do it is a good example of "trivial".)

Hengist_Ludd · Feb 22, 2006

Big Geeko,

Stitching PDFs together can easily be done with PDF Toolkit (PDFTK)
http://www.accesspdf.com/pdftk/

Hengist_Ludd · Feb 22, 2006

Oops, sorry; PDFTK that I mentioned in my previous post stitches together
*PDF* files, not text files. To combine text files, I'd probably use one
of the many ports of the Unix 'cat' utility. I think you can find one
such at http://unxutils.sourceforge.net

Susan Bugher · Feb 22, 2006

Al said:
Susan, I can write 2 versions of a program, one with functions turned
off, with only a 1 BIT difference between them - if that bit is the
test bit for the turned-off functions. For instance, the Save
function starts:

IF <whatever the test condition for that bit is> then {
<do the save stuff>
}

The code is the same, whether the program is crippled or not - except
for the 1 bit the "condition" being tested for differs in. So the
length, even if it's identical to the byte, isn't an indicator of
"sameness".

(I can also write 2 versions of a program that do the same thing but
are different lengths.)

<sheepish grin> Yeah, I know. Just doing a bit of ranting and raving.
I've been looking at too darned many sites that don't seem trustworthy -
sites that don't give people the information they need to make an
informed decision. . .

The author's claim that his app is "uncrippled" conveys no useful
information. He's saying he didn't program in a function and then
disable it. What does that tell me about the usefulness of the program?
Absolutely nothing. . . I'll take *honest* Liteware/Crippleware over
that any day. . . at least I'll know what I'm getting.

Susan
--
Posted to alt.comp.freeware
Search alt.comp.freeware (or read it online):
http://www.google.com/advanced_group_search?q=+group:alt.comp.freeware
Pricelessware & ACF: http://www.pricelesswarehome.org
Pricelessware: http://www.pricelessware.org (not maintained)

PDF Plain Text Extractor

brett.shand

ms

Dan

Big Geeko

Al Klein

Al Klein

Al Klein

Hengist_Ludd

Hengist_Ludd

Susan Bugher