Thursday, October 29, 2009 at 5:50 PM
Webmaster Level: IntermediateGoogle uses numerous sources to find new webpages, from links we find on the web to submitted URLs. We aim to discover new pages quickly so that users can find new content in Google search results soon after they go live. We recently launched a feature that uses RSS and Atom feeds for the discovery of new webpages.
RSS/Atom feeds have been very popular in recent years as a mechanism for content publication. They allow readers to check for new content from publishers. Using feeds for discovery allows us to get these new pages into our index more quickly than traditional crawling methods. We may use many potential sources to access updates from feeds including Reader, notification services, or direct crawls of feeds. Going forward, we might also explore mechanisms such as PubSubHubbub to identify updated items.
In order for us to use your RSS/Atom feeds for discovery, it's important that crawling these files is not disallowed by your robots.txt. To find out if Googlebot can crawl your feeds and find your pages as fast as possible, test your feed URLs with the robots.txt tester in Google Webmaster Tools.


16 comments:
This is could be the first. I've been using RSS/Atom Feeds since I make my blog. The feature is easy to use more delight. Thanks
Interesting, RSS is certainly an easy digestible format. I have 2 things that I find very interesting here and perhaps you guys can answer this.
1: What does this mean for sites that have RSS, will the web pages themselves still be fully crawled and how important will the RSS data be compared to the document itself?
2: What fields of the RSS feeds will Google read and how do these relate to the meta description and title tags for the web pages themselves?
I'll gonna try it...
This is so weird - I'd been assuming all along y'all would consult RSS/Atom feeds for new content. It's such an obvious thing to do and that's what they were invented for. What took you so long? (wink)
So now that you've incentivized the link spammers to redouble their efforts to plague blogs with crappy comments, will the other hand at Google give Blogspot users the ability to update the robots.txt file so that we CAN disallow the atom/RSS feeds?
Do you think Google will ever start indexing the rel="tag" ? It would be cool to search your site or blog by tag.
http://microformats.org/wiki/rel-tag
Brilliant.
A smart practice websites can partake in, upload content to local super hubs for easy indexing (pubsubhubbub, RSScloud)
Nice infomation about feed, but i am still getting problem to index my blog http://vegfruitcarving.blogspot.com
I don't get this. I assumed that Google has access to Feedburner, which knows all my updates as they happen.
OIC, not all blog owners know about Feedburner.
Still, this is a weird bit of news.
feedburner works great.. and i have done feed submissions before and they seems to have triggered something that got my blog post/pages indexed within 24 hours..
amazingly fast
Anyone know if GoogleBot supports following links to other feed pages by grokking Atom Feed Paging and Archiving?
RSS/ATOM feeds are generally details present on website, how can be this useful to discover new URLs
Thanx for the valuable information. RSS is certainly an easy digestible format. I have 2 things that I find very interesting here.. keep posting. Will be visiting back soon.
i sure learned a lot from this article.. thanks.
Related to RSS indexing by Google I have a suggestion :
Under "show options" in Google search there should be an option to retrieve only links to RSS feeds for the particular searched term. This will greatly help in finding RSS feeds related to topic of interest and help in filtering/finding useful content.
Not sure if this is a good idea... google has indexed my rss feeds, and for some reason googlebot thought that some html tags used for formatting the feed are text, so when I look at webmaster tools, the keyword with most significance is 'strong'... weird isn't it?
Post a Comment