Does duplicate content really penalize your SEO status? (33 posts)

  • I would like to post a single article on two different blogs.  My question is whether Google really does penalize you for duplicate content?  I know that I can change the copy around a bit but is the duplicate content issue really an issue?

    Thanks in advance for your feedback!

    -greg
    http://www.beforliving.com

  • @gregorybuford No, they do not penalize you for duplicate content. Rather, they just don’t count the second one. It doesn’t get indexed, and it won’t show up in the search results, but they don’t take a negative action on that content.
    If you want to avoid this issue, you need to change the copy at least 25%. I would shoot for at least 50%, in case they ever change it.

  • That’s a great question Greg–I’ve heard that it’s not wise as Google likes original articles, but I don’t know. Still trying to figure out the “sandbox” theory. I have two different websites, so I post a link to the one that I want the traffic to point towards.

    Maybe you can do the same with the back-link to your main site?

    Linda

    http://www.talktherapybiz.com

  • Excellent feedback!  Thank you.

  • Hi Greg,
    I was just looking into this last week for a blog post I was working on.  According to Google Webmasters site, Google states that there are a number of causes of duplicate content.  If they think it’s to manipulate rankings “the ranking of the site may suffer, or the site might be removed entirely from the Google index, in which case it will no longer appear in search results.”  I think it’s good to avoid it altogether.  You can use copyscape.com to help avoid duplicate content.
    Here’s the post to Google Webmasters page on Duplicate Content - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=66359

  • @gregorybuford The answer & link from from @iamconsulting is definitely what you need to check out.  I wouldn’t suggest creating two sites with the same content – either one site will do, or make the content on the second site unique.  

  • Hello @Linda Esposito - what’s the sandbox theory? Interested to hear more about that. Cheers. 

  • @kristi-hines Thanks for the input.  This is definitely helpful!

  • I agree with those above- Google has stated on their blog that they do not penalize for duplicate content, but they tend not to index the duplicates. So I suggest to either re-write it a bit, or better yet write a new post and refer to the original and include a hyperlink to it in the post.

    @thelizweston a sandbox is a way to try things out w/o them being live. PayPal has a sandbox to test buttons to make sure they work correctly before they’re live on your site.

  • How does this differ from article distribution which is supposed to help your ratings, right?

  • How does this differ from article distribution which is supposed to help your ratings, right?

  • @rachelmaskell The main search benefit of article marketing is the inbound links created to your website in your bio at the bottom of each article. Since the Google Panda updates last year, the significance of article marketing has been drastically reduced. People used to use software to rewrite an article 20 times by switching paragraph order or substituting words, then post the results on 100 different article directories, thereby generating 100 links back to the main site (or even 2000 if they posted every version on 100 directories!). Since Panda, Google has cracked down on this practice, reducing the link-juice you get from these mass-distribution sites packed with mainly duplicate content and text ads. Like @erikdeckers said, it’s really only the first instance of the article that gets indexed that will be counted. You’ll need to modify it substantially to post on other article directories, and I would recommend no more than five total, rather than the mass quantities people were doing before. It’s still a good piece of your overall link-building campaign, but it doesn’t have nearly as much impact on its own as it used to, and shouldn’t be the only way you build links.

  • Hiya Folks.

    I can’t add a lot here as you folks seem to have a handle on this. Rewriting an article to have the same content is not an art but can be tricky. The good news is there are multiple software packages out there that do this for you.  

    I’ve downloaded several from cNet (free downloads). I can spin content pretty fast myself, but when I have an article that may pertain to several clients, I have to have help. 

    The best rated at cNet is (if it still is) “Quick Article Spinner” and is here:http://download.cnet.com/Quick-Article-Spinner/9241-18485_4-11886122.html?messageID=10830940

    I’ve downloaded this one, but have not used it as yet.  I plan to give it a spin in the next day or two.

    :)

  • We like to post one place, make sure it gets indexed, and then post anywhere/everywhere else, but make sure to include a link back to the original article towards the top of the page. I vaguely remember Matt Cutts suggesting this was a good technique. 

    @erikdeckers Why did all the scrapper sites get hurt so much last update if Goog is not penalizing for duplicate content? Or is not being indexed the punishment?

  • @robpeck Thanks for sharing. I love to hear different techniques people have adopted since Panda. When you repost your article after the first one is indexed, do you spin it or post an exact duplicate? If you include a link to the original article, do you also include a link back to the website you’re targeting, or are the additional posts intended more to promote the 1st indexed article?

  • @k4satin We typically add that first sentence or two along the lines of:  ”Our colleagues at xyz posted a great article about [KW rich text for link] which we wanted to share. Questions? Ask in the comments section.”and then post verbatim. 

    Since I am generally “the colleagues” I am writing about it makes me feel like I have multiple personality disorder, but seems to work well. I am mixed on article spinning, good technique, but I hate cluttering the internet. 

    Best,   

  • Thanks, @robpeck. So, in this scenario, the original post is on the site you’re trying to promote rather than on an article directory?

  • @k4satin Exactly. I would place the article on the actual site I am promoting, see that get indexed, and then run wild with the “As seen on…” posts elsewhere on the web. 

    This basically came out of the thought, “Why I am giving PRLog (or whoever) backlinks? Shouldn’t they be sending them to me?”

  • @robpeck Right. I can see sending a link or two to something you posted elsewhere to make sure it gets picked up, but certainly not putting much effort into it.

  • @erikdeckers – Yes! So many people talk about “duplicate content penalty” when Google states bluntly on their webmaster help pages that they do NOT penalize for duplicate content.

    BTW, do you have a link from Google on the “no indexing” assertion? The reason I’m asking is because in the past, they would index both, but only return the page with higher relevance. They might well have changed that policy, thus my question. Thanks in advance.

  • @doolin Which link mentions they don’t penalize for duplicate content? How did so many aggregators get hurt on the Panda update if this is the case? 
    http://support.google.com/webmasters/bin/answer.py?hl=en&answer=66359 mentions ranking and indexing penalties, which link are you referring to?

    Thanks,

  • @robpeck – from that same page: 

    Duplicate content on a site is not grounds for action on that site unless it appears that the intent of the duplicate content is to be deceptive and manipulate search engine results. If your site suffers from duplicate content issues, and you don’t follow the advice listed above, we do a good job of choosing a version of the content to show in our search results.”
    I have no idea why people got hurt on Panda updates, won’t even speculate.

  • @doolin Right, but besides for mobile versions, archives, etc… isn’t the purpose of duplicate content “to be deceptive and manipulate search engine results?”

  • @doolin I don’t have that link.

    @robpeck The aggregators got hurt by Panda because a lot of them would just collect stuff and then link back to the original article with a Read More link. They pulled in about 100 words or so, did it dozens of times a day, and Google decided they weren’t creating or offering good content.

    What Panda wants is good writing and design on the websites they refer people to. Doing a 100 word excerpt for the purposes of creating hundreds of backlinks was not “good content,” and so they got dinged.

    I wrote about this last year. http://problogservice.com/2011/08/09/google-wants-you-to-be-a-better-blog-writer/

  • @erikdeckers Right, agree with all. So from the looks of things Google does penalize duplicate content, yeah? Again not considering strange canonicalization issues, just people posting the same text as someone else. 

    Best,  

  • @robpeck Well, to me (and Google), a duplicate “penalty” means they strike a site from their index, or at least bump it down the ranks, because there’s more than one version of it.

    The Google Panda update penalized a lot of aggregators, not because they duplicated a few articles, but because they were providing a lot of very low-quality content. They were giving 100 word excerpts that would point to another 100 word excerpt, which sometimes led to the same 100 word excerpt, before finally going to the original. It provided no value and was being done strictly for the backlinks.

    Google left the original page alone and dropped all the 100 word sites. Now, the original site may have been lowered because of the changes to the algorithm, but they didn’t go through and hammer on all the original sites.

  • @robpeck – I’m agreeing with Erik on this one, because Google has historically been reasonably transparent about when they do specifically penalize sites. 

    There is definitely a certain amount of duplicate content posted which is deceptive and attempts to manipulate search engine rankings. 

    But not all of it.

    Consider, for example, stories distributed by AP. It’s duplicate content, but it may be shown on many websites, depending on the target audience of the site. 

    My hunch is that deception and manipulation have specific algorithmic signals, while syndication and other forms of duplication have different signals.  

    On the other hand, consider the mathematics required to find and assess duplicate content. The naive case is checking every web page against every other web page. The computational cost of that is roughly “Order(heat-death-of-the-universe)” time scale. (This same limitation applies to LSI, except that LSI is a *lot* more expensive to compute.)

  • @k4satin thank you for the clarification. Seems like there is a lot of gray area altogether too. I’ve started following the feed, any recommendations for websites to learn more?

  • @rachelmaskell I’m glad my comments were helpful. There’s always a lot of gray area in SEO b/c Google is not particularly transparent about their algorithms. Understandably so, since they are trying to assess natural indications of good content and prevent people from gaming the system. Matt Cutts’ blog (http://www.mattcutts.com/blog/ ) is always good to follow to see what they do reveal, but much of the true data comes from SEOs trying things out and testing, testing, testing. Google also shares on the Webmasters Central Blog.

    For info outside what Google shares, I don’t know that I could give you just one source. You get the most value from following several people and considering their analysis together with your own testing. Some reliable sources to follow include SEOmoz, Search Engine Journal, Jill Whalen at High Rankings, Nick Stamoulis…

    How about the other SEO folks here. Who else do you recommend following?

  • @k4satin Your list nailed it. Only ones I would think to add are http://searchengineland.com/ and http://www.seobook.com/ 

    Matt Cutts blog used to so helpful; it took a turn for the strange the past 6 months or so though. More about personal growth than SEO. 

    Best, 

  • Those are good ones too @robpeck. If anyone wants to know more about Panda, I found this on Search Engine Land. The infographic is just a timeline, but at the bottom of the post they list all the articles they’ve written about Panda over the past year.

    http://searchengineland.com/google-panda-update-112805

  • Orlando Social Media
    Normal 0 false false false EN-US X-NONE X-NONE /* Style Definitions */ table.MsoNormalTable {mso-style-name:”Table Normal”; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-parent:”"; mso-padding-alt:0in 5.4pt 0in 5.4pt; mso-para-margin-top:0in; mso-para-margin-right:0in; mso-para-margin-bottom:10.0pt; mso-para-margin-left:0in; line-height:115%; mso-pagination:widow-orphan; font-size:11.0pt; font-family:”Calibri”,”sans-serif”; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin; mso-bidi-font-family:”Times New Roman”; mso-bidi-theme-font:minor-bidi;}By all indications, Google intends to weed out low value results from the search results. Getting your content quality up now offers one of the best opportunities to protect your site from future changes. Structure the site so that the user is able to locate the information and/or content they are looking for; with the least amount of hassle, as possible.
    hit me up on Twitter!
    Blufishocean
    Normal 0 false false false EN-US X-NONE X-NONE /* Style Definitions */ table.MsoNormalTable {mso-style-name:”Table Normal”; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-parent:”"; mso-padding-alt:0in 5.4pt 0in 5.4pt; mso-para-margin-top:0in; mso-para-margin-right:0in; mso-para-margin-bottom:10.0pt; mso-para-margin-left:0in; line-height:115%; mso-pagination:widow-orphan; font-size:11.0pt; font-family:”Calibri”,”sans-serif”; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin; mso-bidi-font-family:”Times New Roman”; mso-bidi-theme-font:minor-bidi;}


  • What a discussion. I suppose this topic attracts a lot of responses in every forum. I picked up a few useful things here.

  • great discussion, thanks for asking this question @gregorybuford


Add your voice to the discussion

Existing members: . If you do not have a SME account, .

 
 
Check out the Social Media Marketing Podcast!
Get your ad placed here!

Networking Clubs Leaderboard

Avatar ImageE
Ann at  greenoakAnn
Kapil MudholkarKapil
Avatar ImageChris
Avatar ImageJudith
Avatar ImageLydia
Avatar ImageAndré
Avatar ImageJameson
KMediaIrelandKMediaIrel
Avatar ImageAlexandra
Learn more about the Networking Clubs

Recently Active Members

Profile picture of
aeronaftis
Harold Williams
Masha Rbandara Perera
Profile picture of
Johnny Hunt
Profile picture of
Profile picture of
Profile picture of
Ann at  greenoak
Profile picture of
Robin Yearsley
Mark Matthews
Profile picture of
Rachel Agheyisi