Concept help - Data Set
Table of Contents
A Data Set describes a record of data, including any location or time boundaries for the data, that has been captured and is available for use under a specific licence. A Data Set may be included in a Data Catalog, and can reference multiple Distributions that record different parts or formats of the data that are available to download.
A a dataset in DCAT is defined as a "collection of data, published or curated by a single agent, and available for access or download in one or more formats". A dataset does not have to be available as a downloadable file. For example, a dataset that is available via an API can be defined as an instance of dcat:Dataset and the API can be defined as an instance of dcat:Distribution. DCAT itself does not define properties specific to APIs description. These are considered out of the scope of this version of the vocabulary. Nevertheless, this can be defined as a profile of the DCAT vocabulary.
Fields available on this metadata type
| Field | ISO definition |
|---|---|
| Name | The primary name used for human identification purposes. |
| Definition | Representation of a concept by a descriptive statement which serves to differentiate it from related concepts. (3.2.39) |
| Is Federated | |
| Is Not Federable | |
| Version | Unique version identifier of this metadata item. |
| References | Significant documents that contributed to the development of the metadata item which were not the direct source for the metadata content. |
| Origin | The source (e.g. document, project, discipline or model) for the item (8.1.2.2.3.5) |
| Comments | Descriptive comments about the metadata item (8.1.2.2.3.4) |
| Deleted | The date after which the item has been soft deleted and is no longer visible in the registry |
| License | Information about the license document under which the dataset is made available. |
| Rights | Information about rights held in and over the dataset. |
| Release Date | Date of formal publication of the dataset. |
| Modification Date | Most recent date on which the dataset was changed, updated or modified. |
| Frequency | The frequency at which dataset is published. |
| Spatial Coverage | Spatial or geographic coverage of the dataset. |
| Temporal Coverage | The temporal or time period that the dataset covers. |
| Catalog | An entity responsible for making the dataset available. |
| Landing Page | A Web page that can be navigated to in a Web browser to gain access to the dataset, its distributions and/or additional information |
| Contact Point | Relevant contact information for the Dataset. |
| Conforming Specification | An established standard to which the described resource conforms. |
| Item Base |
Custom Fields
| Field | Short definition | Long definition |
|---|---|---|
| Data Custodian | ||
| File Size | ||
| Technical Contact Point | A person or a team responsible for the technical aspects and management of a dataset. | |
| Publisher | An entity or individual responsible for making a dataset available to the public or a specific audience. | |
| Sensitivity | ||
| Security Classification | ||
| Data Custodian | ||
| Keyword | ||
| Resource Type | ||
| Purpose | ||
| Sensitive Data | ||
| Legal Authority | ||
| Disposal | ||
| Data Status | ||
| File size | ||
| Format | ||
| Language | ||
| Publisher | ||
| Sensitivity Level | The level of security that should be applied to this data asset, as per the controlled list of options. In this field “Low” means “does not include any aggregate or disaggregate personal information”, “Medium” means “Includes aggregated personal information only”, and “High” means “includes row-level personal information that could identify individuals". | |
| Keywords | ||
| Admin only field |
Official Definition
A representation of a dataset in a catalog. Data Catalog Vocabulary (DCAT): 5.3 Class: Dataset