May 16, 2020
Crawling 8.8 million pages a day, stuck in one section, searching for keywords. Other sections hurt.
They seem to be stuck in an infinite "loop" in one section mainly because the crawlers are searching an endless list of keywords ranging from "dog" to really random words like "rk6ddhw03o56de6" while also changing filters that narrow results. As you can imagine, most of these internal keyword searches return "0 results". This particular section only has 8k individual pages. How do we get them to move onto or at least divide their attention to our other sections which are actually far more important to us than the one they are stuck in since May 3rd? The section it's stuck in dropped 0.2 rank while all others have dropped between 3 - 20 ranks. We were on the first page of many our target keywords and that's no longer the case. We thought they'd naturally move on but it's been too long to ignore and the impact on our traffic is severe.
To make matters worse, or perhaps the main issue, is that we did not have noindex set on these pages until tonight. Haven't had it set for 10+ years. We now have the meta data properly setup to noindex keyword searches but face the problem of how to deindex the millions of 0 result pages. Is there anyway to deindex with a pattern?
For example: /section/$1?keywords=$2
Do we need to wait for crawlers to search through the massive list of keywords again to deindex what they searched for? From what we've read, adding a rule to robots.txt would stop them from searching the section but then they won't be able to see the noindex tag to remove the pages. Who knows how long it will be before they search through the keyword list again.
Ultimately, it's created a very skewed view of what our website is about. Having 23 million pages about one subject with the vast majority have 0 results vs the 5 million higher quality pages they knew about before the update is causing all our rankings to plummet.
We need help / advice. Mainly, how to deindex the problematic pages via the pattern described above.
Community content may not be verified or up-to-date. Learn more.
All Replies (5)