/webmasters/community?hl=en
This content is likely not relevant anymore. Try searching or browse recent questions.
3/1/19
FAQ - Structured data markup for datasets 0 Recommended Answers 0 Replies 1007 Upvotes
1 Recommended Answer
$0 Recommended Answers
Last edited 9/19/19
Have a look before you post -- maybe the answer to your question is already here!
If not, kick off a new thread and choose "Structured Data" as the category on step 2.

Q: What is Google Dataset Search and how do I find out more?

There are several resources that help you learn more about Google Dataset Search:
Going forward, if you would like to receive occasional announcements about Dataset Search, subscribe to this mailing list.


Q: How do I add my dataset to Dataset Search?

If you have a web page that describes a dataset (or many such web pages), you need to do the following to have it included in Dataset Search:
  • [Required] Add metadata in schema.org to each page that describes a dataset (documentation).
  • Verify that the markup produces structured data that you expect in Structured Data Testing Tool
  • If you have multiple pages, create a sitemap and add that sitemap to your Search Console
If the page has been crawled but after a week or two you still don't see it in Dataset Search, please file a bug using the "Feedback" button.

Q: How do I see aggregated data about my datasets' visibility in Google Search?

[Sept. 2019] There is now a structured data report available in Search Console for Datasets. If you have it enabled in your account, you'll find it under the Enhancements sections. Source.

Q: Why is a specific dataset not showing up in Dataset search results?

Most likely, this happens because there is no structured data on the page that describes the datasets. To verify, copy the link for the page that you expect to see in Dataset search results, and put it into the Structured Data Testing Tool. If you do not see any "Dataset" on the right-hand side, this means there is no schema.org/Dataset markup on the page or it is incorrect. If you own the page, you can fix it (instructions) or you can contact the owner of the page.
Even if there is markup on the page, we may not have gotten to it yet. If you own the page, you can check the search console to check the crawl status.

Q: Is a contract required to have the results be listed in Dataset Search?

No. The implementation is structured data mark-up: schema.org, which is an open standard widely used around the web. Mark-up on sites is purely voluntary and partners can remove the mark-up at any time.

Q: Will my site rank higher if I implement this feature?

No, your result will maintain its organic ranking. However, implementing this mark-up is a requirement for the results to show up in Dataset Search.

Q: Can I expect a traffic increase?

As with Search in general, there are no traffic guarantees. However, we believe there will be more discovery of your content by users. And the users that click/tap through to your site will have a much higher intent.

Q: How do I delete a dataset from Dataset search results?

If you don't want a dataset to show up in Dataset search results, and you own the dataset page, simply delete the structured data for schema.org/Dataset on the page. Keep in mind that it might take some time (days or weeks, depending on the crawl schedule) for the changes to be reflected on the Dataset search side.

Q: Why is some information from structured-data markup now displayed in the Dataset search results?

The goal for the results pages is to provide our users with the most reliable and predictable experience across the data that we collect from thousands of repositories. While we use all the structured data in our product, the decision of what to display and how to display it is guided by many different factors. As owners of dataset repositories provide more structured high quality data describing their content, we will continue to enrich the result pages in the product.

Q: What tools are available to help with markup?

You can use the Markup Helper to generate sample JSON-LD markup for a page that describes a dataset. Simply select "Dataset" in the Markup Helper, put in an address for one of your pages, and select and tag different components.
When you already have structured data on the page, the Structured Data Testing Tool (SDTT) is useful to verify the data. You can also use this tool to look at pages from other sites for examples of markup.

Q: How should the publication DOI and Dataset DOI be specified in the markup?

{
"@context" : "http://schema.org/",
"@type" : "Dataset",
"identifier": "https://doi.org/10.5061/dryad.8nm16",
"citation":  "https://doi.org/10.1111/jav.01596"
}

We are not sure why SDTT marks this as a problem since the value for citation can be text, but please be assured that if you plan to include markup as shown in these examples, it should be okay.

An even better markup example would be:

{
"@context" : "http://schema.org/",
"@type" : "Dataset",
"identifier": "https://doi.org/10.5061/dryad.8nm16",
"citation":  {
 "@type" : "Article",
 "identifier" : "https://doi.org/10.1111/jav.01596",
}

as this example specifies what kind of DOI is being provided in the citation. Please note that if you cannot include “citation” property, just specifying the "identifier" property should be fine, but it should follow the same format as in the sample above.

Q: Will Google's crawler be loading JavaScript from pages when looking for markup?

The Structured Data Testing Tool and crawlers should be able to execute javascript and load the markup. That being said, if there are issues with the tool, it would be best to try to adjust the javascript such that the markup is visible on the tool as well. Otherwise, you are allowed to directly place your markup from the page into the structure data tool to verify that the markup is good.
It is also worth noting that it would be best to use the Structured Data Testing Tool provided by Google rather than a third party extension as our tool more closely follows our dataset guidelines.

Q: The “identifier” property is not appearing on the SDTT, why is that?

This is a known issue with the tool which will be addressed by our team in a future iteration. While the structure data testing tool is a great way to validate the syntax of your markup, it may throw errors or warning that may not need to be addressed. So long as the markup of the page source adheres to the guidelines provided in our documentation, there should be no issues.

Q: How should multiple authors be specified in the markup?

We highly recommend using the "citation" property to include this information. You can create an array of citations and order the authors how you see best fits for the dataset.

Q: How would you add acknowledgement for funding in the dataset markup?

Although it is currently not a listed property in our documentation, you can refer to details about the "funder" property on http://schema.org/Dataset.

Q: How would we raise any issues we have about the current Dataset properties or suggest new properties to be added to the feature?

We suggest going to https://github.com/schemaorg/schemaorg/issues to discuss any markup properties changes or issues that your team is seeing.

Q: Is there something formal about actually submitting the sitemap to search console versus  just having dataset content present on the web pages to be discovered organically?

The submission is to ensure that the sitemap will be crawled if has not already been crawled naturally. That being said there is some latency in crawling the pages so please do allow for some time for them to be detected.

Q: We noticed that the structured data tool seems to strip newlines from the output on the right hand side; is that correct?

So long as the new line characters are included in the "description" property for source code of the page, that should be fine.

Q: Is it possible to create a sitemap for a subsection of our web site?

You are allowed to create and submit a sitemap specific to a subsection of your website rather than a sitemap of the entire site. Once you have a sitemap ready, you can submit them to our Google search console.

Q: How would you go about adding multiple “GeoCoordinates” or “GeoShapes”?

You can use an array of values for the “spatialCoverage” property in order to specify multiple points or shapes or a combination of both. I have included a sample below as reference:

"spatialCoverage": [{
   "@type": "Place",
   "geo": {
     "@type": "GeoCoordinates",
     "latitude": 39.3280,
     "longitude": 120.1633
   }
 },
{
   "@type": "Place",
   "geo": {
     "@type": "GeoShape",
     "box": "39.3280 120.1633  40. 445 123.7878"
   }
 }
]


Q: Our company logo is not appearing or is appearing incorrectly next to our dataset search results, how can we adjust this?

There are several steps to getting the image to show up. You should be able to see your logo in the panel on the right hand side of main Google search results, similar to what you see if you search for a search such as [pangaea]. If you don't see the image or it is not the image that you want, you will need to follow the blog entry and this developer resource to update the image.
Details
No replies yet.
This question is locked and replying has been disabled.
Discard post? You will lose what you have written so far.
Write a reply
10 characters required
Failed to attach file, click here to try again.
Discard post?
You will lose what you have written so far.
Personal information found

We found the following personal information in your message:

This information will be visible to anyone who visits or subscribes to notifications for this post. Are you sure you want to continue?

A problem occurred. Please try again.
Create Reply
Edit Reply
Delete post?
This will remove the reply from the Answers section.
Notifications are off
Your notifications are currently off and you won't receive subscription updates. To turn them on, go to Notifications preferences on your Profile page.
Report abuse
Google takes abuse of its services very seriously. We're committed to dealing with such abuse according to the laws in your country of residence. When you submit a report, we'll investigate it and take the appropriate action. We'll get back to you only if we require additional details or have more information to share.

Go to the Legal Help page to request content changes for legal reasons.

Reported post for abuse
Unable to send report.
This reply is no longer available.
/webmasters/threads
//accounts.google.com/ServiceLogin
You'll receive email notifications for new posts at
Unable to delete question.
Unable to update vote.
Unable to update subscription.
You have been unsubscribed
Deleted
Unable to delete reply.
Removed from Answers
Marked as Recommended Answer
Removed recommendation
Undo
Unable to update reply.
Unable to update vote.
Thank you. Your response was recorded.
Unable to undo vote.
Thank you. This reply will now display in the answers section.
Link copied
Locked
Unlocked
Unable to lock
Unable to unlock
Pinned
Unpinned
Unable to pin
Unable to unpin
Marked
Unmarked
Unable to mark
Reported as off topic
/webmasters/profile/0?hl=en