Google’s duplicate detection algorithm is broken

Tomek Rudzki

4 min read

Published: November, 2022

Updated: January, 2026

Google recently changed something in its deduplication systems. And from what I’m seeing, that change was a mistake. In October, I started seeing multiple client websites reporting an increased number of “Duplicate, Google chose different canonical than user” pages. I know it’s not because these websites made significant changes on their end — they are […]

Google recently changed something in its deduplication systems. And from what I’m seeing, that change was a mistake.

In October, I started seeing multiple client websites reporting an increased number of “Duplicate, Google chose different canonical than user” pages.

I know it’s not because these websites made significant changes on their end — they are our clients at Onely. It must have been a change in how Google chooses the canonical URL when presented with multiple variants.

Why did I say this isn’t a positive change? See for yourself:

Unfortunately, I can’t share any specific pages for these websites. But just looking at the URL path, you can see that Google made a mistake. These are not the same:

If it was just one page, I wouldn’t be too worried — bugs happen. But take a look at this chart:

21 million pages were mistakenly marked as duplicates within the last month without any significant structural changes on the client’s end. These pages are now unindexed and don’t bring any traffic to the site. And Google’s shortcomings are costing this business millions of dollars:

At first, I thought this is related to Google’s October spam update, which rolled out on October 19th — coinciding with the sudden increase in the number of “Duplicate, Google chose different canonical than user” pages. But then I found other websites which were similarly affected weeks before that update rolled out. Here are two examples:

Worst of all, in most cases, there’s zero logic to the way Google chooses the canonical variant! The “Duplicate, Google chose different canonical than user” status is fairly common when you fail to differentiate product variants and don’t provide consistent canonical signals.

But in these cases, Google is choosing product A as the canonical page for product B. As in, Google chooses a product page for Samsung Galaxy S20 as the preferred canonical instead of a product page for a JBL speaker! Again, zero logic.

For one of these other websites, Google canonicalized women’s clothing category to… men’s clothing.

We’re not sure what algorithms Google uses for duplicate content detection, but many of them operate on common phrases. For instance, both men’s and women’s category pages contain T-shirts, sportswear, and jeans. But this doesn’t mean they are duplicates. Not even close.

What does this mean?

Google has had similar issues in the past and this might eventually be resolved. But for now, there is no indication from Google that they know about the problem and are working on a solution.

For now, you should remember the following:

Google may have deindexed URLs on your site thinking they are duplicates, even when they are not.
Check your Page indexing report in Google Search Console and see if the number of URLs reported as “Duplicate, Google chose a different canonical than user” or “Duplicate without user-selected canonical” recently spiked.

Your next steps are:

Inspect the pages that you feel are important for your business.
Find examples when Google chose the wrong canonical.
Add unique content to let Google see the page has changed.
Request indexing in Google Search Console.

Know when Google deindexes your pages

We know Google may deindex some pages to free up space for better-quality docs.

We also noticed that Google deindexes many pages during core updates. Ziemek Bućko from Onely wrote an article about this happening after one of the recent updates.

My solution for this is — use ZipTie.

ZipTie can actively monitor if your existing pages remain indexed over time.

You can easily see which URLs got deindexed by Google.

Then, you can still use Google Search Console to figure out why Google may have deindexed them. But without ZipTie, you risk being surprised by your traffic steadily dropping as your pages are getting deindexed without you even knowing.

Tomek Rudzki

Author

Tomek is a co-founder of ZipTie.dev and specializes in AI search optimization and SEO. He regularly shares his insights about AI search on our blog and wrote the ebook "AI Survival for SEO."

August 2025

What are the unique features of ZipTie.dev?

Most AI search tracking tools tell you where you rank but leave you guessing about how to improve - forcing you to figure out optimization strategies on your own. ZipTie is the only platform that combines comprehensive monitoring across Google AI Overviews, ChatGPT, and Perplexity with a built-in content optimization module that provides specific, actionable recommendations for improving your AI search performance. This guide breaks down the unique features that separate ZipTie from basic tracking tools and explains how each one helps you win more visibility in AI-powered search results.

July 2025

3 Steps to Optimize for AI Search Using ZipTie

Most businesses know they need to optimize for AI search but have no systematic process for actually doing it - leaving them to guess at what changes might improve their visibility. ZipTie's content optimization feature analyzes what ChatGPT, Perplexity, and Google AI Overviews actually require, identifies specific gaps in your existing content, and provides actionable recommendations to fix them. This step-by-step guide shows you how to use ZipTie to transform underperforming content into material that earns both citations and brand mentions across major AI search platforms.

May 2025

GSC’s Huge Search Gap

Google Search Console is hiding approximately 50% of your search traffic as "anonymous queries" - leaving you blind to the conversational searches that increasingly drive visitors to your site. Through systematic testing, I've confirmed that GSC fails to track most long-tail, conversational queries until they reach a certain popularity threshold, and even then it only reports data forward from that point. This growing blind spot means you're making strategic decisions about content and optimization based on incomplete data that misses the actual questions your audience is asking.

March 2025

Are Google AI Overviews common in the United Kingdom?

ZipTie just rolled out AI Overviews monitoring to seven new countries – the UK, Australia, Canada, India, Brazil, Japan, and Singapore. This got me wondering: how often do these AI Overviews actually pop up in the UK compared to other places? Since AI Overviews can totally change how people interact with search results, it’s worth […]

December 2024

State Of AI Overviews. 5 Disruptions Found After Analyzing 500k Queries

Google AI Overviews is the most controversial and anxiety-provoking change in search. It is already a top focal point for businesses relying on organic search. We have seen quick adoption of an AI search experience, resulting in the quick rollout of AI Overview, even expanding to 100 other countries and territories recently. With so much […]

November 2024

Entering the revolution of AI Search Engines

AI-powered search engines are changing how we find information online. It’s no longer some toys for geeks. Gartner predicts AI search will capture 25% of the traditional search market by the end of 2025. What is happening now is the rapid development of AI search engines Plus, just a few days ago, there was news […]

14-Day Free Trial

Get full access to all features with no strings attached.

Google’s duplicate detection algorithm is broken

What does this mean?

Know when Google deindexes your pages

Tomek Rudzki

Related content

What are the unique features of ZipTie.dev?

3 Steps to Optimize for AI Search Using ZipTie

GSC’s Huge Search Gap

Are Google AI Overviews common in the United Kingdom?

State Of AI Overviews. 5 Disruptions Found After Analyzing 500k Queries

Entering the revolution of AI Search Engines

14-Day Free Trial