The Google Duplicate Content Penalty: the Truth


Here’s an interesting article by Peter Nisbet of Article-Writing.com on Google’s duplicate content penalty (or lack thereof). Go figure … :>

======================================================

The truth of the Google duplicate content penalty is quite simply that there is none! If that confuses you, then you have been reading too many misinformed forums or blogs where people get stuck on some popular term that they have no idea what it means, and then profess to be experts.

The only experts on the Google duplicate content penalty, and the only people who are qualified to define it, are Google, and in Google’s own words “There is no such thing as a duplicate content penalty“. This comes directly from Google’s Webmaster Central Blog.

That should be the end of this article, at precisely 96 words excluding title as I define my word count. But it is not. Why? Because even though this blog is operated by Google, and even though much the same has been stated by Matt Cutts, Google’s main software engineer, and other Google experts, people still argue and complain about the Google ‘duplicate content penalty’.

So here is the truth: you might ask who am I to know the truth, but I read all the Google blogs and their official statements, and in applying what I learn, I achieve excellent results for my web pages on Google search engine listings: and those of Yahoo, MSN and Bing. So I am coming from a sound base that my results can prove.

As a professional article writer whose customers trust to get them the best results from the articles I write, I have to be very aware of the policies and the way the algorithms work of each of the major search engines, and so I am as qualified as anybody to comment on myths such as this.

The Truth of the Google Duplicate Content Penalty

There is no duplicate content penalty. Google’s major search engine function is to provide a customer the best possible results for a search, based upon the search term (keywords) that the customer has used in the Google search box.

Google’s customers are not:

1. You, who use it to get your web pages listed.

2. Adwords advertisers that use Adwords to advertise their products.

3. Corporations or individuals that use it to have their web pages listed.

4. Internet marketers who recommend others to use Google for advertising or searching.

Google’s customers are those seeking information, whether that is to solve a problem, where to purchase a product at the cheapest price, find a sports result or to get directions to a specific location. Everybody that uses Google uses a search term to find some information that they need. That search term is what you and I refer to as a keyword.

If Google detects several web pages offering exactly the same content, its algorithms will select that which best offers the information required and list that. It might also list one or two other pages offering exactly the same content if there are good reasons for it doing so (e.g. more links to other relevant websites, more other relevant pages on the domain, and so on).

So, not all duplicate content pages will be refused a listing. If these duplicates are articles, then the algorithms that the spiders carry on their backs will take the links from these articles into consideration, the authority of the directory on which it is published, and other factors, before deciding which should be listed. It is wrong to believe that this decision has a chronological factor, but, if you include a link in your article Resource section to your web page that contains the same article, then your page is liable to be listed above the others, partially because of a greater number of links back to it from the other copies, and partially because your entire site is liable to be more relevant than these others to information being sought by Google’s customer.

This is not because yours was created first, but because it better meets Google’s criterion for authoritative back-links. However, if the rest of your website is not equally authoritative, your page might be listed behind another with the same content or even not listed at all.

All of this is designed by Google so that its customer is offered the most relevant range of results to the keywords they used. That is what Google is for, and is its ultimate objective. Google will not penalize any individual or any website for publishing what you refer to as ‘duplicate content’, and it will take your version into consideration for publication just as any other version.

What counts in the long run is which version Google’s algorithms believe to be most likely to provide the best possible information to the person seeking it, and if that means not publishing a whole host of duplicate information, then that is only fair, isn’t it? If you used Google to find some information, you wouldn’t want to find page after page saying exactly the same thing, would you?

No, and neither does Google. A Google listing comes from its indexing of billions of web pages that contain the keywords used by the searcher: both in relation to the entire phrase and to the individual words used in the search term. If you want your copy to be different, make some minor changes and perhaps change the form of the keywords, but most importantly, change the title and the introductory paragraph to which the crawlers will take special notice.

You then have a better chance of your version being listed along with some of the others, but remember: the next time you use the term ‘duplicate content’ you are using a term that does not exist in Google’s vocabulary for any reason than to deny its existence. The Google Duplicate Content Penalty does not exist: the truth!

About The Author
For more information on the mythical duplicate content penalty visit www.article-services.com where Peter Nisbet will also explain how to earn money using article marketing.

Subscribe to Building Mailing Lists

Reblog this post [with Zemanta]

SocialTwist Tell-a-Friend

About rel="canonical"


 
What is a canonical page? Why specify a canonical page?
A canonical page is the preferred version of a set of pages with highly similar content.

 
 

 
It’s common for a site to have several pages listing the same set of products. For example, one page might display products sorted in alphabetical order, while other pages display the same products listed by price or by rating. For example:

http://www.example.com/product.php?item=swedish-fish&trackingid=1234567&sort=alpha&sessionid=5678asfasdfasfd

http://www.example.com/product.php?item=swedish-fish&trackingid=1234567&sort=price&sessionid=5678asfasdfasfd

 

If Google knows that these pages have the same content, we may index only one version for our search results. Our algorithms select the page we think best answers the user’s query. Now, however, users can specify a canonical page to search engines by adding a <link> element with the attribute rel=”canonical” to the <head> section of the non-canonical version of the page. Adding this link and attribute lets site owners identify sets of identical content and suggest to Google: “Of all these pages with identical content, this page is the most useful. Please prioritize it in search results.”

How do I specify a canonical page?
To specify a canonical link to the page http://www.example.com/product.php?item=swedish-fish, create a <link> element as follows:

<link rel="canonical" href="http://www.example.com/product.php?item=swedish-fish"/>

 
Copy this link into the <head> section of all non-canonical versions of the page, such as http://www.example.com/product.php?item=swedish-fish&sort=price.
If you publish content on both http://www.example.com/product.php?item=swedish-fish and https://www.example.com/product.php?item=swedish-fish, you can specify the canonical version of the page. Create the <link> element:

<link rel="canonical" href="http://www.example.com/product.php?item=swedish-fish"/>

 

Add this link to the <head> section of https://www.example.comproduct.php?item=swedish-fish.

Is rel=”canonical” a suggestion or a directive?
This new option lets site owners suggest the version of a page that Google should treat as canonical. Google will take this into account, in conjunction with other signals, when determining which URL sets contain identical content, and calculating the most relevant of these pages to display in search results.

Can the link be relative or absolute?
The rel=”canonical” attribute can be used with relative or absolute links, but we recommend using absolute links to minimize potential confusion or difficulties. If your document specifies a base link, any relative links will be relative to that base link.

Must the content on a set of pages be similar to the content on the canonical version?
Yes. The rel=”canonical” attribute should be used only to specify the preferred version of many pages with identical content (although minor differences, such as sort order, are okay).

For instance, if a site has a set of pages for the same model of dance shoe, each varying only by the color of the shoe pictured, it may make sense to set the page highlighting the most popular color as the canonical version so that Google may be more likely to show that page in search results. However, rel=”canonical” would not be appropriate if that same site simply wanted a gel insole page to rank higher than the shoe page.

What happens if rel=”canonical” points to a non-existent page? Or if more than one page in a set is specified as the canonical version?
We’ll do our best to algorithmically determine an appropriate canonical page, just as we’ve done in the past.

Can Google follow a chain of rel=”canonical” designations?
Yes, to some extent, but to ensure optimal canonicalization, we strongly recommend that you update links to point to a single canonical page.

Can rel=”canonical” be used to suggest a canonical url on a completely different domain?
No. To migrate to a completely different domain, permanent (301) redirects are more appropriate. Google currently will take canonicalization suggestions into account across subdomains (or within a domain), but not across domains. So site owners can specify a canonical page on www.example.com from a set of pages on example.com or help.example.com, but not on example-widgets.com.

 

About This Article:
This article is from Google Webmaster Tools / site management – “About rel=”canonical” – Webmasters/Site owners Help” – Explained by Matt Cutts

 

|   Subscribe to Building Mailing Lists   |   Digg

Reblog this post [with Zemanta]
SocialTwist Tell-a-Friend