Excluding non-news content on your site
We try to include as many different types of sites as possible in Google News, but we only include articles that offer timely reporting on matters that are important or interesting to our audience. Generally, we don't include how-to articles, classified ads, job postings, promotional content or strictly informational articles like weather forecasts or stock data.
Sometimes our system isn't always able to tell these different types of content apart. If your site mixes news content with other types of content, we may not be able to include any of your articles on Google News. There are a few options you may want to consider so that you share only your news content with us:
- Place all of your non-news content on a different host or directory, such as http://jobs.example.com or http://example.com/jobs, so that our crawler will be able to distinguish between your different content types. Please note that you should also be careful how you link from one part of your site to another. Sometimes our crawler will scan one part of your site with news content and find links to other content that may not meet our guidelines for inclusion.
- Set up a robots.txt file to prevent Googlebot-News from accessing any folder or directory with non-news content. Alternatively, you can use META tags to block our crawler from following links to specific pages.
- Create a Google News Sitemap for your news articles only. You'll need to let us know that you'd prefer to have your articles crawled exclusively through your sitemap so that we can make the configuration change on our side.
If you're able to make the necessary changes to allow us to crawl only your news content, please let us know and we'll be happy to review or update your site in our system.