REQ: CHM files

S

Susan Bugher

omega said:
I don't really follow, but anyway, will mention one item. You can start
from a full mirror of the PW site, thus all your links will be relative
in that mirror. And then you can copy the individual directories from
there, for whichever purpose(s) you envision.

Using this? BUILD user defined string = &p/%n.%t

Susan
 
S

Susan Bugher

omega said:
An advantage, that you can approach it from both ends.
Or a disadvantage: more available work.

True (but I do have to tidy up some of the pages either way).

Susan
 
O

omega

Susan Bugher said:
Using this? BUILD user defined string = &p/%n.%t

Yes, all settings the same, except for the Start URL, and the Scan Rules.

You could use that 2005.whtt and its subdirectory that was in my zip.
Copy it somewhere. Delete the hts-cache files (in case necessary),
while preserving the winprofile.ini. That copied *.whtt file can be
renamed, so long as you do the same rename for its copied httrack
subfolder.

Same objective as above should be achievable by export/import of an .opt
file (note -it's important what screen of httrack you're in during that
export). Just this second method I've less experience with, over the first.

Start URL. http://www.pricelesswarehome.org/ or to the index there, or
whichever you see fit.

Then on the Filters, wipe out everything that's there.

You could consider these two killfilters.
-*.zip -*.chm

Although if you download those particular files once, and leave them in
your local archive, httrack won't retrieve them again, until it reads
that their size/time stamps have changed.

There is one thing, however, that might come up, which I haven't addressed.
It concerns the matter of getting some files that aren't meant for end-user
browse (the special footer headers bodies files). I'd have to do the
retrieval project described above to see it there's a knot there, which
might then mean a further settings tweak would be needed.
 
O

omega

Susan Bugher said:
Hadn't thought of trying that. not right now though. . .
Thanks. :)

Just /watching/ how much work you take on, it tires me out so much,
I have to go take a nap.
 
S

Susan Bugher

omega said:
There is one thing, however, that might come up, which I haven't addressed.
It concerns the matter of getting some files that aren't meant for end-user
browse (the special footer headers bodies files). I'd have to do the
retrieval project described above to see it there's a knot there, which
might then mean a further settings tweak would be needed.

The problem child is the 2004 directory.

2003: get the .htm and .php files (plus images if you want them)

2004: get the .php files and *only* the listed .htm files - I posted
those (or see the PL index)

2005: get just the .php files (plus images etc.)

acf: get just the .php files (plus images etc.)


Susan
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top