Managing multi-regional and multilingual sites
If your site offers different content to users in different languages, countries, or regions, you can optimize Google Search results for your site.
- A multilingual website is any website that offers content in more than one language. For example, a Canadian business with English and French versions of its site. Google Search tries to find pages that match the language of the searcher.
- A multi-regional website is one that explicitly targets users in different countries. For example, a product manufacturer that ships to both Canada and the United States. Google Search tries to find the right locale page for the searcher.
Some sites are both multi-regional and multilingual: for example, a site might have different versions for the USA and for Canada, and both French and English versions of the Canadian content.
If you have identical content in multiple languages on your site, here are some tips for helping users (and Google Search) find the right page:
Use different URLs for different language versions
Google recommends using different URLs for each language version of a page rather than using cookies or browser settings to adjust the content language on the page.
If you use different URLs for different languages, use hreflang annotations to help Google search results link to the correct language version of a page.
If you prefer to dynamically change content or reroute the user based on language settings, be aware that Google might not find and crawl all your variations. This is because the Googlebot crawler usually originates from the USA. In addition, the crawler sends HTTP requests without setting
Accept-Language in the request header.
Tell Google about your different language versions
Google supports several different methods for labeling language or region variants of a page, including hreflang annotations and sitemaps. Mark your pages appropriately.
Make sure the page language is obvious
Google uses the visible content of your page to determine its language. We don’t use any code-level language information such as
lang attributes, or the URL. You can help Google determine the language correctly by using a single language for content and navigation on each page, and by avoiding side-by-side translations.
Translating only the boilerplate text of your pages while keeping the bulk of your content in a single language (as often happens on pages featuring user-generated content) can create a bad user experience if the same content appears multiple times in search results with various boilerplate languages.
Use robots.txt to block search engines from crawling automatically translated pages on your site. Automated translations don’t always make sense and could be viewed as spam. More importantly, a poor or artificial-sounding translation can harm your site’s perception.
Let the user switch the page language
If you have multiple versions of a page:
- Consider adding hyperlinks to other language versions of a page. That way users can click to choose a different language version of the page.
- Avoid automatic redirection based on the user’s perceived language. These redirections could prevent users (and search engines) from viewing all the versions of your site.
Use language-specific URLs
It’s fine to use localized words in the URL, or to use an Internationalized Domain Name (IDN). However, be sure to use UTF-8 encoding in the URL (in fact, we recommend using UTF-8 wherever possible) and remember to escape the URLs properly when linking to them.
You can target your website or parts of it to users in a single specific country speaking a specific language. This can improve your page rankings in the target country, but at the expense of results in other locales/languages.
To geotarget your site on Google:
- Page or site level: Use locale-specific URLs for your site or page.
- Page level: Use hreflang or sitemaps to tell Google which pages apply to which locations or languages.
- Site level: If your site has a generic top-level domain (for example, .com, .org, or .eu), specify your site's target locale using the International Targeting report. Don’t use this tool if your site targets more than a single country. For example, it would make sense to set the target as Canada for a site about restaurants in Montreal; it would not make sense to set the target as Canada if it also targets French speakers in France, Canada, and Mali.
Remember that geotargeting isn’t an exact science, so it's important to consider users who land on the "wrong" version of your site. One way to do this could be to show links on all pages for users to select their region and/or language of choice.
Consider using a URL structure that makes it easy to geotarget your site, or parts of it, to different regions. The following table describes your options:
|URL structure||Example URL||Pros||Cons|
|Subdomains with gTLD||de.example.com||
|Subdirectories with gTLD||example.com/de/||
How does Google determine a target locale?
Google relies on a number of signals to determine the best target audience for a page:
- A target locale specified using Search Console's International Targeting report. If you use a generic top-level domain (gTLD) and use a hosting provider in another country, we recommend using Search Console to tell us which country your site should be associated with (if you want to geotarget your site).
- Country-code top-level domain names (ccTLDs). These are tied to a specific country (for example .de for Germany, .cn for China), and therefore are a strong signal to both users and search engines that your site is explicitly intended for a certain country. (Some countries have restrictions on who can use ccTLDs, so be sure to do your research first.) We also treat some vanity ccTLDs (such as .tv, .me, etc.) as gTLDs, as we've found that users and webmasters frequently see these as being more generic than country-targeted (we don't have a complete list of such vanity ccTLDs that we treat as gTLDs, because such a list would change over time). See Google's list of gTLDs.
- hreflang statements, whether in tags, headers, or sitemaps.
- Server location (through the IP address of the server). The server location is often physically near your users and can be a signal about your site’s intended audience. Some websites use distributed content delivery networks (CDNs) or are hosted in a country with better webserver infrastructure, so it is not a definitive signal.
- Other signals. Other sources of clues as to the intended audience of your site can include local addresses and phone numbers on the pages, the use of local language and currency, links from other local sites, and/or the use of Google My Business (where available).
What Google doesn't do:
- Google crawls the web from different locations around the world. We do not attempt to vary the crawler source used for a single site in order to find any possible variations in a page. Therefore, any locale or language variations that your site exposes should be communicated to Google explicitly through the methods shown here (such as hreflang entries, ccTLDs, or explicit links).
- Google ignores locational meta tags (like
distribution) or geotargeting HTML attributes.
If you provide similar or duplicate content on different URLs in the same language as part of a multi-regional site (for instance, if both
example.com/de/ show similar German language content), you should pick a preferred version and use the rel=canonical element and hreflang tags to make sure that the correct language or regional URL is served to searchers.
Generic top-level domains (gTLDs) are domains that aren't associated with specific locations. If your site has a generic top-level domain such as .com, .org, or any of the domains listed below, and wants to target users in a particular geographic location, you should explicitly set a country target using one of the methods described previously.
Google treats the following as gTLDs that can be geotargeted in Search Console:
- Generic Top Level Domains (gTLDs): Unless a top level domain is registered as a country code top level domain (ccTLD) with ICANN, Google will treat any TLD that resolves through the IANA DNS root zone as a gTLD. Examples:
- and many more...
- Generic regional top-level domains: Although these domains are associated with a geographical region, they are generally treated as generic top-level domains (much like .com or .org):
- Generic Country Code Top Level Domains (ccTLDs): Google treats some ccTLDs (such as .tv, .me, etc.) as gTLDs, as we've found that users and webmasters frequently see these more generic than country-targeted. Here is a list of those ccTLDs (this list may change).