O
omega
[...]
I'm going to snip everything for this initial reply. In order to more
narrowly focus on describing the nature of my new uploads.
1. http://www.redshift.com/~omega/pw/pw2005nominations.zip (700k)
This is a direct httrack retrieval of the 2005 directory. I've made no
alterations, left it a hybrid of offline-online. That is, it contains
locally all the PW2005 program description pages, and similar content.
Then it also contains links that go online, to other parts of the PW
site.
I will be using the archive above during the next couple of weeks, as
a guide when mediating on my voting decisions. And maybe also as a ref
when I might want to engage in whichever topics might arise during the
post-voting discussion.
I think there would be a couple of other folks around, dialup users,
who might find it convenient to have the web archive contents of the
file above, stored locally on disk, for the period of the next couple
of weeks. I can keep it updated every 2 days or so, if it's the case
that anyone else has use for it.
2. http://www.redshift.com/~omega/pw/pw2003-04.zip (1.6mb)
This is an offline web archive of PL 2003 and PL 2004. They are separate
folders. I just zipped them together for convenience.
Susan: I was not up to starting over. On the two archives above, I just
took what I had already. The only change I made was to rename to your
suggestion. And, mainly, to fix the internal links. (As a workaround for
the problem that arose from getting the mixed file pairs after your bot
and mine had the odd result from their interaction.) These are offline
files. They also contain title-tags for the php generated pages, such
that all pages within would be conducive to providing unique entries
for CHM TOCs / HTML Indexes.
It might be that you are more interested in a more purely unaltered
mirror of the 2003 and 2004 directories? That's easy business for the
full site. It's when it concerns an offline archive for a particular part
of the site, then the end-form sought is a matter to dwell upon. I find
it confusing, from the perspective of using the file myself, to have the
PL links sometimes point online, and sometimes offline (thus my choice
on these to cut some headers and footers). OTOH, maybe those details
are unimportant for whichever other general purposes the archives would
serve.
Alternate plans, and uses, re archiving the PW 2003 and 2004 archives:
It's something to talk about. Just, as far as my doing that particular
project again immediately, I'd rather not, prefer take a nice long rest
first...
3. Your bot, and mine, and how they interact.
I appreciate the information you've provided, about how those pages get put
together. I'm pretty slow, as well as having zero experience or knowledge
with using a web server script. Although I do have some feeling on what's
goes on, due to your explanations. In conjunction with my observing the
httrack logs, and the content of the files. (Also, I did today take
advantage, and retrieve full directory lists, when you opened it up by
removing index.html files from the roots).
I thought it might turn out that you could want to check out my bot
first-hand?
You, or someone interested in getting involved, to help with the general
project of making chapters of the PWH site available in offline forms.
If you (or someone who has interest in helping) installs Httrack, and then
uses the zip file below I've created, that will provide things set to go for
a full mirror of the site.
http://www.redshift.com/~omega/pw/HttrackProjects.zip (2k)
Most of Httrack's settings work out of the box, but there are several
specific settings that are pertinent to a good mirroring of the PW site.
Originally I was going to post those briefly here. But this post has gone
already into what I'm guessing is the 100+ line range, so it's time to give
a break to attention span.
Btw, I'm hoping that I haven't made a fatal error here, that this doesn't
inspire 200 lurkers to go tromp across your server. I'd thought it over,
thought that took a while, but my best estimate ended that it would be okay
to talk in specifics about sending bots upon the site...
I'm going to snip everything for this initial reply. In order to more
narrowly focus on describing the nature of my new uploads.
1. http://www.redshift.com/~omega/pw/pw2005nominations.zip (700k)
This is a direct httrack retrieval of the 2005 directory. I've made no
alterations, left it a hybrid of offline-online. That is, it contains
locally all the PW2005 program description pages, and similar content.
Then it also contains links that go online, to other parts of the PW
site.
I will be using the archive above during the next couple of weeks, as
a guide when mediating on my voting decisions. And maybe also as a ref
when I might want to engage in whichever topics might arise during the
post-voting discussion.
I think there would be a couple of other folks around, dialup users,
who might find it convenient to have the web archive contents of the
file above, stored locally on disk, for the period of the next couple
of weeks. I can keep it updated every 2 days or so, if it's the case
that anyone else has use for it.
2. http://www.redshift.com/~omega/pw/pw2003-04.zip (1.6mb)
This is an offline web archive of PL 2003 and PL 2004. They are separate
folders. I just zipped them together for convenience.
Susan: I was not up to starting over. On the two archives above, I just
took what I had already. The only change I made was to rename to your
suggestion. And, mainly, to fix the internal links. (As a workaround for
the problem that arose from getting the mixed file pairs after your bot
and mine had the odd result from their interaction.) These are offline
files. They also contain title-tags for the php generated pages, such
that all pages within would be conducive to providing unique entries
for CHM TOCs / HTML Indexes.
It might be that you are more interested in a more purely unaltered
mirror of the 2003 and 2004 directories? That's easy business for the
full site. It's when it concerns an offline archive for a particular part
of the site, then the end-form sought is a matter to dwell upon. I find
it confusing, from the perspective of using the file myself, to have the
PL links sometimes point online, and sometimes offline (thus my choice
on these to cut some headers and footers). OTOH, maybe those details
are unimportant for whichever other general purposes the archives would
serve.
Alternate plans, and uses, re archiving the PW 2003 and 2004 archives:
It's something to talk about. Just, as far as my doing that particular
project again immediately, I'd rather not, prefer take a nice long rest
first...
3. Your bot, and mine, and how they interact.
I appreciate the information you've provided, about how those pages get put
together. I'm pretty slow, as well as having zero experience or knowledge
with using a web server script. Although I do have some feeling on what's
goes on, due to your explanations. In conjunction with my observing the
httrack logs, and the content of the files. (Also, I did today take
advantage, and retrieve full directory lists, when you opened it up by
removing index.html files from the roots).
I thought it might turn out that you could want to check out my bot
first-hand?
You, or someone interested in getting involved, to help with the general
project of making chapters of the PWH site available in offline forms.
If you (or someone who has interest in helping) installs Httrack, and then
uses the zip file below I've created, that will provide things set to go for
a full mirror of the site.
http://www.redshift.com/~omega/pw/HttrackProjects.zip (2k)
Most of Httrack's settings work out of the box, but there are several
specific settings that are pertinent to a good mirroring of the PW site.
Originally I was going to post those briefly here. But this post has gone
already into what I'm guessing is the 100+ line range, so it's time to give
a break to attention span.
Btw, I'm hoping that I haven't made a fatal error here, that this doesn't
inspire 200 lurkers to go tromp across your server. I'd thought it over,
thought that took a while, but my best estimate ended that it would be okay
to talk in specifics about sending bots upon the site...