PageTraffic SEO Blog

Subscribe To Page Traffic Blog
Subscribe via RSS
Subscribe via Email

Control The Indexing And Accessing Of Your Sites By Search Engines

January 27th, 2007 | 1,631 Views RSS Feed



If you're new here, you may want to subscribe to our Full RSS feed to get a daily digest of news around search engine industry.

A post on Google blog gives important details to the web publishers about how they can control indexing and accessing of sites by search engines and Google itself. The most important tool in this regard is the robots.txt file. Robots.txt file gives powerful control to site owners on how the site is searched. The post reads “you may have a few pages on your site you don't want in Google's index. For example, you might have a directory that contains internal logs, or you may have news articles that require payment to access. You can exclude pages from Google's crawler by creating a text file called robots.txt and placing it in the root directory. The robots.txt file contains a list of the pages that search engines shouldn't access. Creating a robots.txt is straightforward and it allows you a sophisticated level of control over how search engines can access your web site.”

Besides the robots.txt file there is robots META tag by which you can gain more fine control over the individual pages. This requires specific META tags to HTML pages giving you the control over the way individual page is indexed.

Click here to subscribe to our RSS feed to get a daily digest of news around search engine industry. PageTraffic SEO Blog is updated four times a day and is ranked as one of the best search engine resources blog by Pandia!


Comments

Leave a Reply

PageTraffic SEO Blog Sponsors

Subscribe To Our SEO Blog


Enter your email address:

Delivered by FeedBurner



The Associates

SEO Blogs - Blog Catalog Blog Directory

Back to Top

Copyright © 2006-2008 PageTraffic SEO Blog. All rights reserved.

RSS feeds. WordPress Theme by Candid Software.

Googlebot visited this page Thursday, July 17, 2008