<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>cssOrigins.com &#187; Googlebot</title>
	<atom:link href="http://www.cssOrigins.com/tag/googlebot/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cssOrigins.com</link>
	<description>Css Origins, Design Blog and Wordpress Publishing Platform</description>
	<lastBuildDate>Mon, 12 Jul 2010 06:35:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=abc</generator>
		<item>
		<title>A Deeper Look At Robots.txt</title>
		<link>http://www.cssOrigins.com/2009/a-deeper-look-at-robotstxt/</link>
		<comments>http://www.cssOrigins.com/2009/a-deeper-look-at-robotstxt/#comments</comments>
		<pubDate>Fri, 17 Apr 2009 23:02:36 +0000</pubDate>
		<dc:creator>Hunter Brelsford</dc:creator>
				<category><![CDATA[Archive]]></category>
		<category><![CDATA[News Press]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[Robots.txt]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://www.cssOrigins.com/?p=211</guid>
		<description><![CDATA[Robots.txt syntax User-Agent: the robot the following rule applies to (e.g. “Googlebot,” etc.) Disallow: the pages you want to block the bots from accessing (as many disallow lines as needed) Noindex: the pages you want a search engine to block AND not index (or de-index if previously indexed). Unofficially supported by Google; unsupported by Yahoo [...]


No related posts.]]></description>
			<content:encoded><![CDATA[<p><strong>Robots.txt syntax</strong></p>
<ul>
<li>User-Agent: the robot the following rule applies to (e.g. “Googlebot,” etc.)</li>
<li>Disallow: the pages you want to block the bots from accessing (as many disallow lines as needed)</li>
<li>Noindex: the pages you want a search engine to block AND not index (or de-index if previously indexed). Unofficially supported by Google; unsupported by Yahoo and Live Search.</li>
<li>Each User-Agent/Disallow group should be separated by a blank line; however no blank lines should exist within a group (between the User-agent line and the last Disallow).</li>
<li>The hash symbol (#) may be used for comments within a robots.txt file, where everything after # on that line will be ignored. May be used either for whole lines or end of lines.</li>
<li>Directories and filenames are case-sensitive: “private”, “Private”, and “PRIVATE” are all uniquely different to search engines.</li>
</ul>
<p>Let’s look at an example robots.txt file. The example below includes:</p>
<ul>
<li>The robot called “Googlebot” has nothing disallowed and may go anywhere</li>
<li>The entire site is closed off to the robot called “msnbot”;</li>
<li>All robots (other than Googlebot) should not visit the /tmp/ directory or directories or files called /logs, as explained with comments, e.g., tmp.htm, /logs or logs.php.</li>
</ul>
<p><code>User-agent: Googlebot Disallow: </code></p>
<p><code>User-agent: msnbot Disallow: / </code></p>
<p><code># Block all robots from tmp and logs directories User-agent: * Disallow: /tmp/ Disallow: /logs # for directories and files called logs </code></p>
<p>via <a href="http://searchengineland.com/a-deeper-look-at-robotstxt-17573">A Deeper Look At Robots.txt</a> .</p>


<p>No related posts.</p>]]></content:encoded>
			<wfw:commentRss>http://www.cssOrigins.com/2009/a-deeper-look-at-robotstxt/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
