<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>googlebot Archives - Sky&#039;s Blog</title>
	<atom:link href="https://blog.red7.com/tag/googlebot/feed/" rel="self" type="application/rss+xml" />
	<link>https://blog.red7.com/tag/googlebot/</link>
	<description>Communicating in a networked world</description>
	<lastBuildDate>Thu, 04 Dec 2014 23:35:29 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://blog.red7.com/wp-content/uploads/2018/01/skyhi-wind-icon-256x256-120x120.png</url>
	<title>googlebot Archives - Sky&#039;s Blog</title>
	<link>https://blog.red7.com/tag/googlebot/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Even Robots.txt won&#8217;t keep the googlebot away</title>
		<link>https://blog.red7.com/robots-txt-googlebot/</link>
					<comments>https://blog.red7.com/robots-txt-googlebot/#respond</comments>
		
		<dc:creator><![CDATA[sky]]></dc:creator>
		<pubDate>Tue, 06 Nov 2012 18:23:54 +0000</pubDate>
				<category><![CDATA[Blogging]]></category>
		<category><![CDATA[Organizations and Sociology]]></category>
		<category><![CDATA[Our networked world]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Social tools]]></category>
		<category><![CDATA[Technology and geeky stuff]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[googlebot]]></category>
		<category><![CDATA[search]]></category>
		<guid isPermaLink="false">http://blog.red7.com/?p=3538</guid>

					<description><![CDATA[<p>Well am I ever surprised! I would have thought that inserting a robots.txt file that tells googlebot to &#8220;go away&#8221; would cause it to &#8220;not index the site.&#8221; User-agent: * Disallow: / Instead, I discovered that the googlebot may still spot the site and then put up a message saying that the site exists but [&#8230;]</p>
<p>The post <a href="https://blog.red7.com/robots-txt-googlebot/">Even Robots.txt won&#8217;t keep the googlebot away</a> appeared first on <a href="https://blog.red7.com">Sky&#039;s Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><img decoding="async" class="alignleft size-full wp-image-3539" style="border: 0px none; margin: 4px 12px;" title="FFF-TUSJ-g" src="/wp-content/uploads/2012/11/FFF-TUSJ-g.png" alt="" width="100" height="100" />Well am I ever surprised! I would have thought that inserting a <strong>robots.txt</strong> file that tells googlebot to &#8220;go away&#8221; would cause it to &#8220;not index the site.&#8221;</p>
<blockquote>
<p>User-agent: *<br /> Disallow: /</p>
</blockquote>
<p>Instead, I discovered that the googlebot may still spot the site and then put up a message saying that the site exists but is not indexed. i.e. the Googlebot still publicizes the existence of the site. It makes Google look like the <em>good guys</em> and us look like the <em>bad guys</em> for putting up a robots.txt. Yay for Google liberating all online information! Boo for us trying to keep our site un-indexed until we’re ready to make it public.<span id="more-3538"></span>I suppose if the site is public, they reason it&#8217;s OK to mention its existence. However, most of us did not intend for any results whatsoever to show up in Google, so having it say &#8220;the site exists but I can&#8217;t index it&#8221; is a big of a revelation! Beware of this if you are creating a pre-production test site &#8212; your site may still show up in Google searches. Instead, turn on some other protection &#8212; like the “Maintenance mode” plug-in for WordPress, so that not only sites but humans can’t use the site. Here&#8217;s kind what the Google result looks like:</p>
<blockquote style="background-color: #ffffff;">
<p><span style="color: #0000ff;">Mork-A-Bork » Uncategorized</span><br /> <strong><span style="color: #339966;">mork-a-bork.info/</span></strong></p>
<div style="text-align: left; color: #222222; margin-top: 5px; margin-bottom: 10px;">A description for this result is not available because of this site&#8217;s robots.txt — learn more</div>
</blockquote>
<p>The post <a href="https://blog.red7.com/robots-txt-googlebot/">Even Robots.txt won&#8217;t keep the googlebot away</a> appeared first on <a href="https://blog.red7.com">Sky&#039;s Blog</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://blog.red7.com/robots-txt-googlebot/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3538</post-id>	</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: blog.red7.com @ 2026-04-02 11:22:35 by W3 Total Cache
-->