Dataset Schema Validation API

Dataset schema markup enables your data to appear in Google Dataset Search — a specialized search tool for finding datasets. Scientists, researchers, journalists, and developers use Google Dataset Search to find public data. Valid Dataset structured data requires a name and description, with distribution details that help users access the actual data files.

Required Properties

PropertyTypeDescription
namerich result
stringThe name of the dataset.
descriptionrich result
stringA summary describing the dataset's contents, purpose, and methodology.

Recommended Properties

Missing recommended properties generate warnings. They won't block validation, but rich result-tagged properties are needed for full Google rich result eligibility.

PropertyTypeDescription
url
URLThe URL of the dataset landing page.
creator
Person or OrganizationThe creator or author of the dataset.
license
URLThe license under which the dataset is released.
keywords
string arrayKeywords or tags that describe the dataset.
distribution
DataDownload arrayHow to access the dataset files. Each DataDownload should have contentUrl and encodingFormat.
variableMeasured
string or PropertyValueThe variable(s) measured in the dataset.
identifier
URL or PropertyValueA DOI or other identifier for the dataset.

Rich Result Eligibility

Rich result type: Dataset listing in Google Dataset Search

Requirements: Required: name and description. distribution with DataDownload objects (contentUrl and encodingFormat) allows direct data access.

Example valid Dataset JSON-LD

{
  "@context": "https://schema.org",
  "@type": "Dataset",
  "name": "Global Structured Data Adoption Survey 2026",
  "description": "A survey of 10,000 websites measuring Schema.org structured data adoption rates, type distribution, and rich result eligibility across industries.",
  "url": "https://example.com/datasets/structured-data-survey-2026",
  "creator": {
    "@type": "Organization",
    "name": "SchemaCheck Research"
  },
  "license": "https://creativecommons.org/licenses/by/4.0/",
  "keywords": [
    "schema.org",
    "structured data",
    "SEO",
    "rich results",
    "JSON-LD"
  ],
  "distribution": [
    {
      "@type": "DataDownload",
      "encodingFormat": "text/csv",
      "contentUrl": "https://example.com/datasets/structured-data-survey-2026.csv"
    },
    {
      "@type": "DataDownload",
      "encodingFormat": "application/json",
      "contentUrl": "https://example.com/datasets/structured-data-survey-2026.json"
    }
  ]
}

Validate via API

Validate this schema against Google's requirements in one API call:

curl "https://schemacheck.dev/api/v1/validate" \
  -H "x-api-key: YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"jsonld":{"@context":"https://schema.org","@type":"Dataset","name":"Global Structured Data Adoption Survey 2026","description":"A survey of 10,000 websites measuring Schema.org structured data adoption rates, type distribution, and rich result eligibility across industries.","url":"https://example.com/datasets/structured-data-survey-2026","creator":{"@type":"Organization","name":"SchemaCheck Research"},"license":"https://creativecommons.org/licenses/by/4.0/","keywords":["schema.org","structured data","SEO","rich results","JSON-LD"],"distribution":[{"@type":"DataDownload","encodingFormat":"text/csv","contentUrl":"https://example.com/datasets/structured-data-survey-2026.csv"},{"@type":"DataDownload","encodingFormat":"application/json","contentUrl":"https://example.com/datasets/structured-data-survey-2026.json"}]}}'

Official Documentation

Google Structured Data: Dataset

Validate your Dataset schema now

100 free validations/month. No credit card required.