Robots.txt and Google
|
10-27-2007, 08:31 PM,
Post: #1
|
|||
|
|||
Robots.txt and Google
I've got a poser for 'ya: I'm blocking all dynamic content to my site via my robots.txt file. This has been verified through the use of Google robots.txt analyzer (through Google's sitemaps). Now try this search: http://www.google.com/search?num=100&hl=en&safe=off&q=%22Cyrus+Rowlett+Smith%22+1941 In the SERP, go down 10-15 spots and you'll find our page (do a text search for dewittco if you want). Now, note the two links. The top link is the dynamic page and under that is the static page (the page that I WANT indexed). Can someone explain this to me? (((Note that the top link has the "www" subdomain, while the lower one doesn't. Not sure what that means, either.))) |
|||
10-27-2007, 09:39 PM,
Post: #2
|
|||
|
|||
Re: Robots.txt and Google
Quote:Now, note the two links. The top link is the dynamic page and under that is the static page (the page that I WANT indexed). Maybe you should block the lower ranking static pages instead. |
|||
10-28-2007, 05:56 PM,
Post: #3
|
|||
|
|||
Re: Robots.txt and Google
It's been considered to eliminate the static pages altogether, but Google (et al) seem to crawl - and more importantly, index - static pages much, much faster. I'm just a bit baffled as to why G is indexing blocked pages. |
|||
10-28-2007, 09:01 PM,
Post: #4
|
|||
|
|||
Re: Robots.txt and Google
They're indexing the dynamic page but not caching it.
Try something like this: Code: User-agent: Googlebot or Code: User-agent: Googlebot |
|||
10-28-2007, 10:17 PM,
Post: #5
|
|||
|
|||
Re: Robots.txt and Google
You could also try it without the asterisks
Code: User-agent: Googlebot You can try adding the query string (the ?xxxid= part)to the other pages you want blocked in your robots.txt file. Googlebot is probably seeing product.php and product.php?productid=xxx as 2 separate pages which is why your block isn't working.
"Well, Jay was so giddy that someone named Jay was involved with this site we posted our first non-eBay listing in 3 years here at Lunarbid (we tried two items at Yahoo once upon a time, they bombed)" -Marie posting in a LunarBid thread at OTWA in 2005 wins the award for 'most moronic reason ever given for choosing a venue"
"thanks twat u must have nothing better 2 do. do u talk to all your members like that. will not be recomending your site. best way to put it is TULIPTOOLS.COM IS REALLY SHIT. DONT JOIN." -pubescent owner of rinky dink off2auction.com in 2011 |
|||
10-28-2007, 10:35 PM,
Post: #6
|
|||
|
|||
Re: Robots.txt and Google
OK... thanks Regic & BBH - I'll be trying that... actually, I don't see the harm in adding all three suggestions, do you? Aside from this, here's another question (I briefly discussed this with the boss earlier): What if we do away will all of our product's static pages and allow crawling of those pages - any idea how big of a hit we'll take in the SERP's? I don't dare do this so close to the Holidays, but I'm thinking of doing it early next year. Please opine! |
|||
10-29-2007, 03:41 PM,
Post: #7
|
|||
|
|||
Re: Robots.txt and Google
Quote:What if we do away will all of our product's static pages and allow crawling of those pages - any idea how big of a hit we'll take in the SERP's? The static pages aren't that old. I think you'd recover quickly from any temporary drop. I don't know why X-cart doesn't just use mod_rewrite instead of using static pages (which were a popular feature of shopping cart and directory scripts in the 90's before Apache added mod_rewrite).
"Well, Jay was so giddy that someone named Jay was involved with this site we posted our first non-eBay listing in 3 years here at Lunarbid (we tried two items at Yahoo once upon a time, they bombed)" -Marie posting in a LunarBid thread at OTWA in 2005 wins the award for 'most moronic reason ever given for choosing a venue"
"thanks twat u must have nothing better 2 do. do u talk to all your members like that. will not be recomending your site. best way to put it is TULIPTOOLS.COM IS REALLY SHIT. DONT JOIN." -pubescent owner of rinky dink off2auction.com in 2011 |
|||
10-29-2007, 09:23 PM,
Post: #8
|
|||
|
|||
Re: Robots.txt and Google
Quote:static pages (which were a popular feature of shopping cart and directory scripts in the 90's Search engines have gotten much better at indexing dynamic content over the years. Name me a site that doesn't use dynamic content today. |
|||
« Next Oldest | Next Newest »
|
Users browsing this thread: 2 Guest(s)