PC Review


Reply
Thread Tools Rate Thread

/robots.txt at end of URL?

 
 
Les Caudle
Guest
Posts: n/a
 
      20th Jul 2007
I'm noticing that web requrests are coming in with /robots.txt appended at the
end:

http://www.domain.com/ProductDetails...527/robots.txt

I can correct these, one by one for each page, but I'd like to find a way to
have ASP.NET 2.,0 strip this invalid /robots.txt off the end of any URL for me.

Is this possible?
--
Thanks in advance, Les Caudle
 
Reply With Quote
 
 
 
 
George Ter-Saakov
Guest
Posts: n/a
 
      20th Jul 2007
Let me asking why do you want to strip it?
robors.txt is asked by automatic robots that collect information. Like
google or yaho.

But in your case apparently it's some lame robot that can not parse out url
correctly simply ads /robot.txt to the end of url and asks for it from the
server.
So why would you worry what kind of garbage it will get in return from your
page.
All important robots you should be worring about like Google or Yahoo parse
out url correctly.

So i am saying just ignore it. It's not human.

George.



"Les Caudle" <(E-Mail Removed)> wrote in message
news(E-Mail Removed)...
> I'm noticing that web requrests are coming in with /robots.txt appended at
> the
> end:
>
> http://www.domain.com/ProductDetails...527/robots.txt
>
> I can correct these, one by one for each page, but I'd like to find a way
> to
> have ASP.NET 2.,0 strip this invalid /robots.txt off the end of any URL
> for me.
>
> Is this possible?
> --
> Thanks in advance, Les Caudle



 
Reply With Quote
 
 
 
 
Les Caudle
Guest
Posts: n/a
 
      20th Jul 2007
Well, it creates an event in my event log that would distract me from real
events caused by users who had problems.

Would be nice to be able to globally deal with it.

Regards, Les Caudle

On Fri, 20 Jul 2007 10:28:48 -0400, "George Ter-Saakov" <gt-(E-Mail Removed)>
wrote:

>Let me asking why do you want to strip it?
>robors.txt is asked by automatic robots that collect information. Like
>google or yaho.
>
>But in your case apparently it's some lame robot that can not parse out url
>correctly simply ads /robot.txt to the end of url and asks for it from the
>server.
>So why would you worry what kind of garbage it will get in return from your
>page.
>All important robots you should be worring about like Google or Yahoo parse
>out url correctly.
>
>So i am saying just ignore it. It's not human.
>
>George.
>
>
>
>"Les Caudle" <(E-Mail Removed)> wrote in message
>news(E-Mail Removed)...
>> I'm noticing that web requrests are coming in with /robots.txt appended at
>> the
>> end:
>>
>> http://www.domain.com/ProductDetails...527/robots.txt
>>
>> I can correct these, one by one for each page, but I'd like to find a way
>> to
>> have ASP.NET 2.,0 strip this invalid /robots.txt off the end of any URL
>> for me.
>>
>> Is this possible?
>> --
>> Thanks in advance, Les Caudle

>


 
Reply With Quote
 
George Ter-Saakov
Guest
Posts: n/a
 
      20th Jul 2007
Well, the problem is that this time it's robot.txt. Next time (with another
bad robot) it will be something else.
You can not fix it for every bad robot that out there.
Override Application_OnError and send an email to yourself every time
something bad happened. Then you can use Email's rules to filter out the
most annoying ones

George.


"Les Caudle" <(E-Mail Removed)> wrote in message
news:(E-Mail Removed)...
> Well, it creates an event in my event log that would distract me from real
> events caused by users who had problems.
>
> Would be nice to be able to globally deal with it.
>
> Regards, Les Caudle
>
> On Fri, 20 Jul 2007 10:28:48 -0400, "George Ter-Saakov"
> <gt-(E-Mail Removed)>
> wrote:
>
>>Let me asking why do you want to strip it?
>>robors.txt is asked by automatic robots that collect information. Like
>>google or yaho.
>>
>>But in your case apparently it's some lame robot that can not parse out
>>url
>>correctly simply ads /robot.txt to the end of url and asks for it from the
>>server.
>>So why would you worry what kind of garbage it will get in return from
>>your
>>page.
>>All important robots you should be worring about like Google or Yahoo
>>parse
>>out url correctly.
>>
>>So i am saying just ignore it. It's not human.
>>
>>George.
>>
>>
>>
>>"Les Caudle" <(E-Mail Removed)> wrote in message
>>news(E-Mail Removed)...
>>> I'm noticing that web requrests are coming in with /robots.txt appended
>>> at
>>> the
>>> end:
>>>
>>> http://www.domain.com/ProductDetails...527/robots.txt
>>>
>>> I can correct these, one by one for each page, but I'd like to find a
>>> way
>>> to
>>> have ASP.NET 2.,0 strip this invalid /robots.txt off the end of any URL
>>> for me.
>>>
>>> Is this possible?
>>> --
>>> Thanks in advance, Les Caudle

>>

>



 
Reply With Quote
 
Walter Wang [MSFT]
Guest
Posts: n/a
 
      22nd Jul 2007
Hi Les,

I agree with George, as long as your web site can be accessed publicly, you
cannot guarantee every URL request is valid or in expected manner.


Regards,
Walter Wang ((E-Mail Removed), remove 'online.')
Microsoft Online Community Support

==================================================
When responding to posts, please "Reply to Group" via your newsreader so
that others may learn and benefit from your issue.
==================================================

This posting is provided "AS IS" with no warranties, and confers no rights.

 
Reply With Quote
 
 
 
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Qury sub-forms breaks front-end to back-end when imprting from new front-end jphelan1242 via AccessMonster.com Microsoft Access Queries 0 17th May 2006 06:41 PM
in OWA, the End key should go to end of page, not end of inbox =?Utf-8?B?TmVhbCBXcmlnaHQ=?= Microsoft Outlook Discussion 0 5th Aug 2005 03:36 AM
When a program is not responding, "END NOW" does not end program. =?Utf-8?B?cGNjaGVt?= Windows XP General 1 4th Feb 2005 12:35 AM
End to end test performance =?Utf-8?B?Sm9obm55?= Windows XP General 4 30th Nov 2004 09:27 PM
HELP linking two routers end to end King David Windows XP Networking 0 22nd Oct 2003 05:11 PM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 09:29 PM.