Webmaster Key - Discussion Forums


Welcome, Guest. Please login or register.
Did you miss your activation email?
February 09, 2012, 07:13:40 AM

Login with username, password and session length
Welceome to Forums!

Important information for guests and new members:

In order to understand the full benefits of becoming an active member of this forum, please review the following information on guest and new member restrictions. These forum changes have been prompted by an overwhelming and unreasonable amount of bot postings and incoherent guest spam messages. We wish to prevent these events from happening in the future and make our community a more comfortable place for all of our members.

For guests:

Guests are not allowed to open new topics, polls, or posts attachments.
If you wish to open up new discussions on this forum, we encourage you to register.

For new members:

New members with less than five posts are not allowed to modify additional profile information such as avatars, contact information, biographies, and signatures. However, new members are encouraged to post their own topics or reply to topics initiated by other members. Become active on the forums and 5 posts should be an easy task!

We are a diverse community with members from all over the world. We encourage new ideas and interesting conversation. Do not be afraid to post webmaster/computer-related questions or problems, as our active members are always willing to help when they are able. Interested? Join us.

+ Webmaster Key Forums
|-+ Website Marketing
| |-+ Search Engines and Directories (Moderator: Andy)
| | |-+ Robots.txt Disallow Urls
0 Members and 1 Guest are viewing this topic. « previous next »
Pages: [1] Go Down Stumble Upon! Digg It! del.icio.us! Add to Technorati! ReddIt!  Send this topic Print
Author Topic: Robots.txt Disallow Urls  (Read 2471 times)
White Wolf
Key Keeper
Full Member
***
Posts: 180



« on: October 23, 2007, 10:26:42 PM »

My site uses somekind of redirect

The urls look like " www. mydomain.com/?page=apply&id=XXXX     were XXXX displays the item ID #

My question is there a way to disallow  just  /?page=apply&id=XXXX ?  with one line or do i need to add each url

User-agent: *
Disallow:  /?page=apply&id=1234
Disallow:  /?page=apply&id=1235
Disallow:  /?page=apply&id=1236


   
Report to moderator   Logged

Andy
Administrator
Veteran
*****
Posts: 5 752



« Reply #1 on: October 24, 2007, 09:59:51 AM »

You only need part of the url e.g.

Disallow:  /?page=apply


From http://www.robotstxt.org/wc/norobots.html :
Quote
Disallow
    The value of this field specifies a partial URL that is not to be visited. This can be a full path, or a partial path; any URL that starts with this value will not be retrieved. For example, Disallow: /help disallows both /help.html and /help/index.html, whereas Disallow: /help/ would disallow /help/index.html but allow /help.html
Report to moderator   Logged

White Wolf
Key Keeper
Full Member
***
Posts: 180



« Reply #2 on: October 25, 2007, 12:35:09 AM »

Thanks for your reply Andy.

I'm tired of having to add to the Robots.txt everytime I add a card. Wink
Report to moderator   Logged

SensoVision
Administrator
Veteran
*****
Posts: 5 857


I'm proud user of Debian GNU/Linux OS


WWW
« Reply #3 on: October 25, 2007, 09:15:11 PM »

Kevin, to make sure you've banned right pages you may use a tool which comes with Google Webmaster Central: http://www.google.com/webmasters/sitemaps/siteoverview
once googlebot spider your page you can simply check certain ulrs if they are accessible for spider. There are other useful tools available there so it's worth to register.
Report to moderator   Logged

Denis
White Wolf
Key Keeper
Full Member
***
Posts: 180



« Reply #4 on: October 25, 2007, 09:39:18 PM »

Thanks Dennis!

I tried out the tool, and Andy's suggestion does work correctly.
Report to moderator   Logged

Pages: [1] Go Up Stumble Upon! Digg It! del.icio.us! Add to Technorati! ReddIt!  Send this topic Print 
+ Webmaster Key Forums
|-+ Website Marketing
| |-+ Search Engines and Directories (Moderator: Andy)
| | |-+ Robots.txt Disallow Urls

Jump to:  
« previous next »


Our Partners
RelmaxTOP Ranking System Web Hosting RelmaxTOP Ranking System
Staff Sites
12Noon[12Noon Gallery] Andy[Urgentclick]
Tamuril[Tamuril's Digital Art Exhibit] Sensovision
Powered by MySQL Powered by PHP We are hosted by Relmax Inc. |Our Privacy Policy | Sitemap
Powered by SMF 1.1.9 | SMF © 2006-2009, Simple Machines LLC
Forum design by Tamuril © 2005.
Valid XHTML 1.0! Valid CSS!