An indexing study of 1.7 million pages identifies the biggest reason your pages are not indexed by Google: they are actively being removed from search results.
Yes, it does define what quality means in the context of not indexed pages.
Pages that we monitor that have the 3 indexing states (URL is unknown to Google, Crawled - currently not indexed and discovered - currently not indexed) are actively being removed by Google's index.
Usually because of "quality" issues.
This is what our tool and data have shown over the last 12 months for important pages. Especially on large websites.
This is fairly common knowledge now (though happy to see it confirmed again) the question though is how is your tool determining it was "quality" issues and what exact does that mean? There is no methodology or definitions above, would be far more helpful and impactful to have that information.
How are you defining quality / quality issues?
Thanks for the comment Joe.
Quality = Pages being actively removed or forgotten by Google based on 3 specific indexing states.
More here: https://indexinginsight.substack.com/i/159829185/quality-issues
This doesn't define what quality means. How specifically is your study determining if a page is or is not quality?
Yes, it does define what quality means in the context of not indexed pages.
Pages that we monitor that have the 3 indexing states (URL is unknown to Google, Crawled - currently not indexed and discovered - currently not indexed) are actively being removed by Google's index.
Usually because of "quality" issues.
This is what our tool and data have shown over the last 12 months for important pages. Especially on large websites.
Which is why I called this category quality.
This is fairly common knowledge now (though happy to see it confirmed again) the question though is how is your tool determining it was "quality" issues and what exact does that mean? There is no methodology or definitions above, would be far more helpful and impactful to have that information.