Ezilon Directory  Submit Articles
 Author Login


Community News & Articles 
 
 World News
 Africa
 Asia
 Australia
 Central America
 Europe
 Middle East
 New Zealand
 North America
 South America
 United Kingdom
 India
 Caribbean
 Ireland
 
 Sports News
 Basketball
 Football
 Soccer
 Others
 Golfing
 Hunting
 
 Entertainment
 Movies
 Music
 Television
 Games
 
 Internet Articles
 Internet Design Articles
 Internet Marketing Tips
 Search Engine Help
 
 Fashion Articles and News
 Women Fashion
 Men's Fashion
 
 Health Articles and News
 Health and Beauty
 Diseases
 
 Weight Loss / Management
 
 Social and Cultural Issues
 Wedding
 Dating
 Relationships
 
 Women Issues and Articles
 
 Business and Industry
 Real Estate Properties
 Travel and Holidays
 Insurance
 Loans
 Stock and Trading
 Investing
 Legal
 
 Science & Technology
 Telephony and Voip
 MP3 and iPod
 Conferencing Calling
 
 Environment
 
 Finance and Business
 
 Home & Family
 Food and Cooking
 Crafts
 Decorations
 
 United Nation
 
 Men Issues
Search

Internet Articles : Internet Design Articles Last Updated: May 9th, 2011 - 08:37:04


The proper way to use the robot.txt file
By Jimmy Whisenhunt
Feb 6, 2005, 15:51

Email this article
When optimizing your web site most webmasters don’t consider using the robot.txt file. This is a very important file for your site. It let the spiders and crawlers know what they can and can not index. This is helpful in keeping them out of folders that you do not want index like the admin or stats folder or content that they can not index.

Here is a list of variables that you can include in a robot.txt file and there meaning:

1)User-agent: In this field you can specify a specific robot to describe access policy for or a “*” for all robots more explained in example.
2)Disallow: In the field you specify the files and folders not to include in the crawl.
3)# the number sign represents comments

Here are some examples of a robot.txt file for redball.com

User-agent: *
Disallow:

The above would let all spiders index all content.

Here another

User-agent: *
Disallow: /cgi-bin/

The above would block all spiders from indexing the cgi-bin directory.

User-agent: googlebot
Disallow:

User-agent: *
Disallow: /admin.php
Disallow: /cgi-bin/
Disallow: /admin/
Disallow: /stats/

In the above example googlebot can index everything while all other spiders can not index admin.php, cgi-bin, admin, and stats directory. Notice that you can block single files like admin.php.




About the Author
Jimmy Whisenhunt is the owner of VIP Enterprises

          
Internet Design Articles
Latest Headlines
» Website design easier than ever
» How To Create and Conduct Internet Online Business For Profits
» How To Use RSS Feeds To Share News and Content
» Headers And Footers To Make Your Web Page Look Dynamic
» How to create a web page using HTML For Beginners
» The Top 20 Things You Can Do to Make Your Website Accessible
» The Psychology of Colors in Website Design
» Most Professional Commercial Sites Now Using PHP
» Is Your Website Designed to Sell? Graphics and Colors Effects
» How To Redirect Your URL Using META Tag