Getting into Google News: Quickstart guide
Step 2: Review technical Google News index guidelines
Google News uses a computer algorithm to automatically crawl news sites. To help our system determine which of your webpages are articles, make sure your site follows our technical guidelines.
Here are some common areas where publishers encounter technical issues:
- Article URLs. Make sure your article URLs are unique and permanent.
- Article format. Our crawler can’t crawl PDFs or other non-HTML formats.
- Robots.txt or metatags. For your content to be included in Google News, our user-agents Googlebot and Googlebot-news must be able to crawl and index your site.
- Multimedia content. We currently are unable to include audio files or multimedia content. However, we can sometimes crawl supplementary text on pages with this type of content and do include some videos from YouTube.