Converting an entire webpage

  • Thread starter Thread starter Anonymous
  • Start date Start date
A

Anonymous

Hi Everyone,

You've seen those wizards like
http://www.dynamicdrive.com/emailriddler/ that convert text into
"ampersands plus a short series of numbers." I think it's called "hash"
or something like that.

Is there a tool that can convert an entire webpage into that "hash
code" or whatever it's called?

I have a large list of contacts and their address/telephone data and
email addresses and I want to convert the entire page into "hash" so
that bots don't come and feast on the data.

Any ideas would be helpful, and thanks in advance.
 
Hi Everyone,

You've seen those wizards like
http://www.dynamicdrive.com/emailriddler/ that convert text into
"ampersands plus a short series of numbers." I think it's called "hash"
or something like that.

Is there a tool that can convert an entire webpage into that "hash
code" or whatever it's called?

I have a large list of contacts and their address/telephone data and
email addresses and I want to convert the entire page into "hash" so
that bots don't come and feast on the data.

Any ideas would be helpful, and thanks in advance.

There is a prm called "eText" which will kinda do whjat you want. It's
basically an encryption pgm. You paste any text into it, click encrypt,
and it produces nothing but hash that you can insert ina message, a web
page, whatever. For anyone else to see what it is they need to have the
eText exe on their syste,. They copy and paste it into eText, click, and
there's the unencrypted text.
 
There is a prm called "eText" which will kinda do whjat you want. It's
basically an encryption pgm. You paste any text into it, click encrypt,
and it produces nothing but hash that you can insert ina message, a web
page, whatever. For anyone else to see what it is they need to have the
eText exe on their syste,. They copy and paste it into eText, click, and
there's the unencrypted text.

Thanks Ben, but that isn't going to work in this case. I need the page
to appear on the net without eText being installed on the audience's
computers.
 
Thanks Ben, but that isn't going to work in this case. I need the page
to appear on the net without eText being installed on the audience's
computers.

Okay, but that conflicts with what you asked - that the page be turned into
garbage so a spider can't read it.

The page will still be there with eText - but a spider CANNOT read anything
from it. One only needs eText to actually see the text properly.
 
Anonymous said:
Hi Everyone,
....
I have a large list of contacts and their address/telephone data and
email addresses and I want to convert the entire page into "hash" so
that bots don't come and feast on the data.

Any ideas would be helpful, and thanks in advance.

Hiya anonymous. Instead of "hashing" the pages, have you thought of
setting up a robot.txt file? This handles the crawler bots from most
(if not all) major web indexers (google, yahoo, msn, etc).

http://www.robotstxt.org/wc/norobots.html

Other than that, have you considered putting password protection on the
page in question?

hth,
-Craig
 
Thanks Ben, but that isn't going to work in this case. I need the page
to appear on the net without eText being installed on the audience's
computers.

But it's not just changed to numbers. It's changed to javascript code
[document.write(some script including the "encrypted" email address)]
if you want it to show on the page. In order to do what you want
you'd have to turn that entire part of your page into document write
lines - and your users would have to have javascripting enabled,
something a lot of people won't do these days. (You could do it in
vbscript too, but the same problem, since script runs client-side.)
But to have a program somehow format the page the way you want it
formatted? I doubt it. It would have to be done manually - pretty
much by generating the script that page generates, then inserting the
code - manually - where you wanted it. For each name, address, phone
# - whatever you wanted hidden from spiders. The code to generate the
script is simple, even to be able to feed it a page of text and
getting a page of script - writing a program to make a web page out of
that script automatically isn't.
 
I have a large list of contacts and their address/telephone data and
email addresses and I want to convert the entire page into "hash" so
that bots don't come and feast on the data.

Any ideas would be helpful, and thanks in advance.

Thinking outside the square here.

Why not do a screen grab of the list and save it as an image. Put the image
on the website page without any alt text, page text or keywords.
 
Hiya anonymous. Instead of "hashing" the pages, have you thought of setting
up a robot.txt file? This handles the crawler bots from most (if not all)
major web indexers (google, yahoo, msn, etc).

http://www.robotstxt.org/wc/norobots.html

Other than that, have you considered putting password protection on the page
in question?

hth,
-Craig

You're gonna love this: YES! I DID suggest passwording the page---but
the list is a political organization and they want their email
addessess available to the public but not to the spammers-bots. They
also don't know a whole lot about building webpages either...
 
Anonymous said:
Thinking outside the square here.

Why not do a screen grab of the list and save it as an image. Put the image
on the website page without any alt text, page text or keywords.

I can make a PDF out of it or save it as a jpg---but---this is a
political organization and they want people to be able to click on
their email addresses. (They don't know a whole lot about building
webpages...)
 
Anonymous said:
Is there a tool that can convert an entire webpage into that "hash
code" or whatever it's called?

The entire page can not be encoded.

The tags need be in HTML.

In other words...

This:

<html>hi</html>

encoded as:

<html>hi</html>

Will not work.

This will work:

I have a large list of contacts and their address/telephone data and
email addresses and I want to convert the entire page into "hash" so
that bots don't come and feast on the data.

You need to encode only the data you wish to obscure.

A lot of cut and paste or a fancy Perl script.
 
Back
Top