Google Webmaster Central Blog - Official news on crawling and indexing sites for the Google index

Traffic drops and site architecture issues

Sunday, June 28, 2009 at 3:39 AM

Webmaster Level: Intermediate.

We hear lots of questions about site architecture issues and traffic drops, so it was a pleasure to talk about it in greater detail at SMX London and I'd like to highlight some key concepts from my presentation here. First off, let's gain a better understanding of drops in traffic, and then we'll take a look at site design and architecture issues.

Understanding drops in traffic

As you know, fluctuations in search results happen all the time; the web is constantly evolving and so is our index. Improvements in our ability to understand our users' interests and queries also often lead to differences in how our algorithms select and rank pages. We realize, however, that such changes might be confusing and sometimes foster misconceptions, so we'd like to address a couple of these myths head-on.

Myth number 1: Duplicate content causes drops in traffic!
Webmasters often wonder if the duplicates on their site can have a negative effect on their site's traffic. As mentioned in our guidelines, unless this duplication is intended to manipulate Google and/or users, the duplication is not a violation of our Webmaster Guidelines. The second part of my presentation illustrates in greater detail how to deal with duplicate content using canonicalization.

Myth number 2: Affiliate programs cause drops in traffic!
Original and compelling content is crucial for a good user experience. If your website participates in affiliate programs, it's essential to consider whether the same content is available in many other places on the web. Affiliate sites with little or no original and compelling content are not likely to rank well in Google search results, but including affiliate links within the context of original and compelling content isn't in itself the sort of thing that leads to traffic drops.

Having reviewed a few of the most common concerns, I'd like to highlight two important sections of the presentation. The first illustrates how malicious attacks -- such as an injection of hidden text and links -- might cause your site to be removed from Google's search results. On a happier note, it also covers how you can use the Google cache and Webmaster Tools to identify this issue. On a related note, if we've found a violation of the Webmaster Guidelines such as the use of hidden text or the presence of malware on your site, you will typically find a note regarding this in your Webmaster Tools Message center.
You may also find your site's traffic decreased if your users are being redirected to another site...for example, due to a hacker-applied server- or page-level redirection triggered by referrals from search engines. A similar scenario -- but with different results -- is the case in which a hacker has instituted a redirection for crawlers only. While this will cause no immediate drop in traffic since users and their visits are not affected, it might lead to a decrease in pages indexed over time.




Site design and architecture issues
Now that we've seen how malicious changes might affect your site and its traffic, let's examine some design and architecture issues. Specifically, you want to ensure that your site is able to be both effectively crawled and indexed, which is the prerequisite to being shown in our search results. What should you consider?

  • First off, check that your robots.txt file has the correct status code and is not returning an error.
  • Keep in mind some best practices when moving to a new site and the new "Change of address" feature recently added to Webmaster Tools.
  • Review the settings of the robots.txt file to make sure no pages -- particularly those rewritten and/or dynamic -- are blocked inappropriately.
  • Finally, make good use of the rel="canonical" attribute to reduce the indexing of duplicate content on your domain. The example in the presentation shows how using this attribute helps Google understand that a duplicate can be clustered with the canonical and that the original, or canonical, page should be indexed.


In conclusion, remember that fluctuations in search results are normal but there are steps that you can take to avoid malicious attacks or design and architecture issues that might cause your site to disappear or fluctuate unpredictably in search results. Start by learning more about attacks by hackers and spammers, make sure everything is running properly at crawling and indexing level by double-checking the HTML suggestions in Webmaster Tools, and finally, test your robots.txt file in case you are accidentally blocking Googlebot. And don't forget about those "robots.txt unreachable" errors!

The comments you read here belong only to the person who posted them. We do, however, reserve the right to remove off-topic comments.

23 comments:

Brandon said...

When viewing in google reader this post redirects me to a google docs login page.... very very annoying

Derek said...

Same thing is happening to me in Reader. Using Chrome.

Greta said...

Please address the issue of CMS blog-style pages...Full or partial duplicate content that appears on one aggregate, feature page -- such as front page -- but then also is a separate (full) page onto itself.

I don't believe that I have seen a solution for this, and canonical isn't appropriate as the featuring page(s) rotate content regularly.

Tyme

Brad said...

Greta,

Why isn't canonical appropriate for that? All of those articles featured on the homepage/featurepage have permanent addresses don't they?

Brad said...

Oops I misread your post, now I understand what you are saying.

Have you seen any site penalized for having snippets of an article on the homepage though - I don't think I have.

Jennifer said...

Hi,
My website www.video-to-flash.com has been moved to www.realwo.cn. And when I enter
Site: www.video-to-flash.com the first result turns out to be a Chinese blog, and the URL is still www.video-to-flash.com. Could you please help me out by withdraw the change of address?

Here is the screenshot of the problem I am reporting.
http://www.video-to-flash.com/images/change_address.gif

Please help! Thanks a million.

Webblogs said...

Google fix this problem all ready click the refresh button when you try to see it.

MIKE said...

Does Google 'drop' individual pages from its indexing? (Or can a single page go to the Supplemental Index after having ranked in the top 100?). The home page on my site (www.buzzwords.ltd.uk/) doesn't show in at least the first thousand pages for the keyword 'copywriting', even though the page is optimised for that keyword. Also, Buzzwords is a copywriting agency with references to 'copywriting' throughout the site. Help please! Mike Beeson

DataPlus - Custom Data Services said...

I am a scardy cat about moving my site. I set up a new domain and have kept the old - I link to the old from the new. I was hoping my comfort level would increase. I have PR 3 and decent traffic for a service site. Any problem doing it this way?

karthi said...

The information’s given by you really good. I got good knowledge from your information’s. Thanks

To view more details in web development, visit http://www.netultimate.com/SEO.php

Jorijn said...

There's a bug in this page (embedded docs) which redirects me to a google.com apps site to login when viewing in Google Reader... Took me 2 days to find out it was this post :P!

Ellithy said...

I diagree with "unless this duplication is intended to manipulate Google and/or users, the duplication is not a violation of our Webmaster Guidelines."

On my Blog I recieve about 20 comments daily, each comment gets a link and is considered duplicate to the post, that is because the showcomments= urls generated by blogger for each comment which is exactly the same content as the post.
I use the Rel Canonical for this blog.
Really and I mean by using analytics and blog stats, SERPs decrease when duplicate urls increase and of course blogger does not "intend to manipulate Google"

Sanjeev Saikia said...

Hi,
I have a blog: www.sanjeevsaikiaart.blogspot.com, until recently whenever there was a search in google, my blog's name and the relevant post title appeared together and the site came up high when the search was relevant.

Since a few days back the title of the post doesn't appear in the search result, only the blog title and maybe a sub title or a tag of the post.

Why has this happened and how can I correct it?

Regards,

Sanjeev
sanjeev.twitters@gmail(dot)com

The Chimp said...

" Myth number 1: Duplicate content causes drops in traffic!"

I have to say I disagree with this point; while DC does not directly cause a drop in traffic, it does cause drops in rankings (as everyone will have seen). This in turn causes a drop in traffic.

As such it's a bit misleading to suggust that this is a myth.

Oh Cheri! said...

Google appears to be treating a critical word on my site as a "stop word". I sell plus size lingerie and the #2 keyword on my site (per my webmaster tools) is 'size'. But 'plus' doesn't appear at all even though these two words go together 99% of the time. In addition, my Sitelink appears as Size Lingerie, when it should say Plus Size Lingerie. This is seriously hurting my ability to reach potential customers. What can I do to make Googlebot see Plus Size as a phrase? My URL is www.OhCheri.com

Alex Smith said...

The issue that that I have over 75k pages on my site (www.ShoeCartel.com), but I have been battling with my Google indexing. I have Google indexing the site each day, but I have pages constantly falling out of the index, which causes me tremendous problems with my traffic for long tail terms that I usually rank well for. Why this is happening? what I can do to get that number back up? I was at 40k at one point, but I am down to just 1,260 pages now. At the beginning of the week, I was at 16,600 pages in Google index.

Oh Cheri! said...

My big drop in traffic happened because Google Base Scheduler isn't working...when are you going to fix this? I see lots of people discussing this on the forums but no response from Google.

Kenneth Udut (free.naplesplus.us - Naples News, Info, Jobs) said...

I have a bit of confusion I'm trying to figure out. I have a "short link" version of the url of most pages on my site. It consists of a go.php/nickname which is shorter than the full name, which includes an index number.

Example:

http://free.naplesplus.us/articles/view.php/50854/jandmcleaningservicesofnaples is the full url and

http://free.naplesplus.us/go.php/jandmcleaningservicesofnaples is the shorter url. Not much shorter, but still shorter.

Surrounding the "shortcut" url I have rel="self alternate shorter shorturl shortlink" in the hopes that engines will pick up that this is also a good url to reach this page at, just shorter.

The go.php makes a 301 redirect, which is proper SEO.

My question is: Will this result in duplicate content? and will a rel=canonical help in indexing in this scenerio?

Ken, Naples, FL

apothema.gr said...

Hello to all,
On the 3d of July our website was under DDOS attack and the attack lasted for about 6 days on & off. In order to maintain our uptime we cut-off all access from foreign countries(we are located in Greece).Unfortunately by restricting acces only to Greek ISP's we also cut-off Google bot...! One other action we took was to make a "dummy" home page in order to reserve server resourses since the home page would not need to be reloaded by each hit but would refresh every 5 minutes. Since Saturday the 18th we noticed a huge drop of our organic hits by approx. 80%. My question is what caused the drop? The inability of Google to bot aour site due to the restrictions or the presence of the dummy page and why did Google take more than 2 weeks to degrade our organic results?
Thank you.

JustBen said...

Regarding Affiliate programs.

My site ( http://www.cellup.com ) specializes in Cell phone accessories such as batteries chargers and headsets. We have approximately 500,000 pages of dynamically generated product and compatibility information.

I belong to two different affiliate programs that are not my primary source of revenue but are substantial. One with about 2,500 pages generated from an xml feed with phones and calling plans and another with around 400,000 ring tones games etc.

I believe that these products are complementary to my core business but if the inclusion is hurting my ranking and therefore sales of the cell phone accessories I would pull them off my site completely.

Does anyone have any idea how Google would view all of these pages? They are relevant and complimentary but there are several thousand pages.

I've currently pulled the sections and pages off of my site for fear of getting blacklisted and only offer a single link to each of the affiliated sites at the top of each page.

Since doing this my affiliate sales have gone to near zero from making around $2,000 more monthly.

Mahmud said...

thanks for your post about traffic drops and site architecture issues. I got very helpful information from there.

tobiascooks! said...

Thank you for these very useful tips. I have checked my website thoroughly I am am facing a drastic drop in traffic. Non of the issues seem to apply though. I have raised the issue at the support forum but have no answer for days. I would be very thankful if anyone can give me a feedback on why my blogs traffic has dropped so drastically.

http://www.google.com/support/forum/p/Webmasters/thread?fid=3aa1923ac8c2c7ac000493b94fd5dd44&hl=en

Cheers Tobias

Google Webmaster Central said...

Hi everyone,

Since over a year has passed since we published this post, we're closing the comments to help us focus on the work ahead. If you still have a question or comment you'd like to discuss, free to visit and/or post your topic in our Webmaster Central Help Forum.

Thanks and take care,
The Webmaster Central Team