{"id":4236,"date":"2016-10-31T18:40:33","date_gmt":"2016-10-31T16:40:33","guid":{"rendered":"http:\/\/xtremefreelance.com\/?p=4236"},"modified":"2016-10-29T00:43:58","modified_gmt":"2016-10-28T21:43:58","slug":"duplicate-content-google-webmaster-central-office-hours-hangout","status":"publish","type":"post","link":"https:\/\/xtremefreelance.com\/duplicate-content-google-webmaster-central-office-hours-hangout\/","title":{"rendered":"Duplicate content issues and more at Google Webmaster Central office-hours hangout"},"content":{"rendered":"

[vc_row][vc_column][vc_video link=”https:\/\/youtu.be\/c4wQnwVtBZo” align=”center”][vc_separator border_width=”5″][vc_column_text]<\/p>\n

Duplicate content issues<\/h2>\n
\n\n

Hot to check for duplicate content<\/h3>\n

\"duplicate

What is duplicate content<\/strong><\/h3>\n

Duplicate content is any text repeated in more than one web page, either on your site or outside. This is what happens when a web page appears with different URLs. But also when a spammer copy text from your page and modify it and post it on his website.<\/p>\n

At first glance it may appear that duplicate content is not so important, but the truth is that it is a very serious problem. Google search engine users expect different results, not the same results repeated. So to avoid this, the search filters prevents the occurrence of duplicate content.<\/p>\n

The consequences of duplicate content<\/strong><\/h3>\n

Now that you know why it is so important to avoid duplicate content, you should know the problems that may arise in your site. Some of the most important are:<\/p>\n

Incorrect page<\/strong> – different pages with the same content to let search engine to make the best choice. This is not a good choice because the browser can choose a version that we do not want.
\nPoor visibility<\/strong> – As a result of this search engine can show a version with worse optimization and therefore a lower rank.
\nIndexing issues<\/strong> – indexing your pages may be affected due to the fact that search engine search the duplicate pages instead of pages that really are important. In many cases duplicate content gets to be a significant portion of indexed pages.
\nLost links –<\/strong> duplicate pages can get links and \u00a0link power will be diluted.
\nMoreover, you should know that Google rejects duplicate content, not penalize; it just filters it out\u00a0and this is punishment enough \u00a0to consider avoiding it.<\/p>\n

Causes duplicate content<\/h3>\n

The main source of your site duplicate content is the site itself and does not matter how well you’ve optimized in terms of SEO. As you will see there are plenty of reasons why you can have a lot of duplicates without knowing.<\/p>\n

These are the main reasons:<\/strong><\/p>\n

Noncanonical links<\/strong>– Your website can work with as a subdomain that begins with the prefix “www” \u00a0while the main domain does not begin with this prefix. Canonical version is the good one good and if is not set correctly your content\u00a0appears in both variants thus generating duplicates.
\nHTTPS pages<\/strong> – similar to what happens to the canonical urls above, if using SSL encryption\u00a0on a site, you can have an exact copy of your site on the secure (https) and one non-secure (http)
\nDynamic content<\/strong> – There are sites that assign url parameters to control the content. As with session IDs, search engines interpret this as a duplicate.
\nArchives<\/strong> – A typical problem is that blogs can show the same content on different pages such as categories and tags.
\nPaging<\/strong> – Any site that uses paging may have this problem, especially if you share a page title and description.<\/p>\n

Off-site duplicate content:<\/h3>\n

Syndication<\/strong> – used to send your content to other websites to generate traffic, such as via RSS. The problem can occur when these sites publish a full copy of the content, instead of a fragment.
\nLocation<\/strong> – To target\u00a0your content to several countries it could be used the same content (or almost) in several domains such as .com and localized domains
\nScraping<\/strong> – Scrapers are people who are using a software copy of some or all your content and publish it in another sites.
\nPlagiarism<\/strong> – Anyone who copy some text and publish it on his website as their own. Sometimes it happens intentionally.<\/p>\n

How can we detect duplicate content<\/h3>\n

Google identifies duplicate content primarily through pages with titles, descriptions, identical or very similar content . Therefore, if you want to find duplicate content on your site should start here.<\/p>\n

Here are\u00a0you the most effective methods to find duplicate content:<\/p>\n

Google Webmaster Tools<\/strong> – If you registered the site in Google Webmaster Tools, this is definitely the best place to start. Access the \u00a0your site in Search -> Enhancements -> HTML and pay attention to duplicate title tags and meta descriptions. This instrument will show the amount of duplicates so you can review them.
\n“site” command in search<\/strong> – it is an effective method, but requires some work. Consists in searching the website for particular words or phrases such as products, if is an online store (eg site sample.com “product in the store”) In the results you can see if the titles and descriptions are duplicated .
\nScreaming Frog<\/strong> is a powerful tool that allows you to track your site for duplicate content, among others. What will matter are Page Title, Meta Description and H1 with Filter Duplicate.
\nGoogle Analytics<\/strong>– can find also the ratio of duplicate pages in Content -> Site content -> pages of destination. The key is to look at URLs and pages that receive less traffic than they should have.
\nWhere duplicate content is outside your site can use the command “site” to detect it, however there are tools as
Copyscape<\/a>. Other SEO tools that help detect duplicate content are Duplichecker, Plagiarism and Plagium.<\/p>\n

Eliminate duplicate content<\/h3>\n

Clearly, search engines do not like duplicate content, it leads to a poor user experience. So if your site has duplicate content, you need to do everything possible to eliminate it.<\/p>\n

These are the main options for solving the duplicate problem:<\/strong><\/p>\n

Uses Rel Canonical<\/strong> – The label “rel = canonical” was designed precisely to address this problem, so it is the best solution. It consists of a line of code in the <head> section of your HTML page.
\n301 Redirect<\/strong> – is the best thing when you cannot use the canonical tag, when you move content from one page to another.
\nDeny access to robot<\/strong>s – To prevent search engines to find duplicate pages you can help robots through robots.txt file
\nIn case of duplicate content offsite it is best to ask by email \u00a0the offenders to remove this content. If this does not work ask that at least have a link redirected to a page from where it is copied, so the search engine will get help to identify the original.<\/p>\n

As a last option, you can ask Google to remove the page in search results through a request based on US law protection of copyright (DMCA). You will also help improve search engine results by detecting duplicate content and\u00a0sending your case as an example.<\/p>\n

Further conclusions and some tips:<\/h3>\n

Never use the same description \/ title in more than one page<\/strong>
\nThe text of each page must be unique<\/strong>
\nDo not forget to use the canonical tags<\/strong>
\nWhen you copy a quote from another place always include a link directed to the original<\/strong>
\nIf you copy an entire page, ask permission before including a link to the source.<\/strong>[\/vc_column_text][\/vc_column][\/vc_row][vc_row][vc_column][\/vc_column][\/vc_row]<\/p>\n<\/div>","protected":false},"excerpt":{"rendered":"

[vc_row][vc_column][vc_video link=”https:\/\/youtu.be\/c4wQnwVtBZo” align=”center”][vc_separator border_width=”5″][vc_column_text] Duplicate content issues Hot to check for duplicate content Duplicate content is one of the most common SEO problems and, interestingly, one [\u2026]<\/span><\/p>\n","protected":false},"author":1,"featured_media":4240,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"rank_math_lock_modified_date":false,"footnotes":""},"categories":[408,304,302,309],"tags":[412,416,448,447,415],"class_list":["post-4236","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-content-marketing","category-marketing-advertising","category-seo","category-tips","tag-content-marketing","tag-duplicate-content","tag-duplicate-content-issues","tag-google-webmaster-central-office-hours-hangout","tag-seo"],"_links":{"self":[{"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/posts\/4236","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/comments?post=4236"}],"version-history":[{"count":0,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/posts\/4236\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/media\/4240"}],"wp:attachment":[{"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/media?parent=4236"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/categories?post=4236"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/xtremefreelance.com\/wp-json\/wp\/v2\/tags?post=4236"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}