For the complete documentation index, see llms.txt. This page is also available as Markdown.

Dataset Attributes & Responsibilities

Dataset settings allow you to define key metadata, ownership, and business context for your datasets. This information helps ensure data governance, accountability, and seamless integration with other tools like your data catalog.

Dataset Owner

Purpose of the Dataset Owner

Each dataset should have a designated dataset owner: a person or team responsible for the dataset's quality, availability and usage.

Typically, the role of a Dataset Owner includes:

  • Defining and maintaining the dataset's purpose and documentation.

  • Ensuring the dataset meets data quality standards and contract requirements.

  • Responding to issues, such as failed checks or data quality alerts.

  • Reviewing and approving changes to the dataset schema or contract.

Responsibilities

What are Responsibilities?

Responsibilities allow you to assign permissions to users or user groups, ensuring they have the access they need to work with a dataset.

A Responsibility is a combination of:

  • A User or User Group.

  • A Dataset Role, which is a predefined collection of permissions (such as the ability to edit contracts, view checks, or manage settings).

By assigning Responsibilities, you define who can do what for each dataset, supporting clear ownership, governance, and collaboration.

Default Dataset Owner role

Every dataset has a default Dataset Owner role, automatically assigned to the designated Dataset Owner(s).

  • This role provides essential permissions to manage and maintain the dataset.

  • The Dataset Owner role cannot be removed, but it can be combined with other roles for additional permissions.

Data source responsibilities

Responsibilities can also be assigned at the data source level, mapping a user or user group to a data source role. A user must have a data source responsibility to see the datasets and checks onboarded from that data source.

Dataset Attributes

Only users with the Configure Dataset permission on the dataset can update the attributes. Global and Dataset Roles

Purpose of Attributes

Dataset attributes allow you to add descriptive metadata to your datasets. This metadata can then be:

  • Used for filtering in Soda Cloud, making it easier to search and organize datasets and checks based on specific criteria (e.g., business domain, sensitivity, criticality).

  • Leveraged in reporting, enabling you to group datasets, track ownership, and monitor data quality across different categories or dimensions.

Adding meaningful attributes enhances discoverability, governance, and collaboration within Soda and its integrations.

Learn how to define attribute types:

Integrate with data catalog

You can automate the management of dataset attributes and responsibilities in Soda Cloud using our REST API. This allows you to:

  • Programmatically set or update attributes for multiple datasets.

  • Assign responsibilities (users, groups, and roles) to datasets at scale.

  • Keep your Soda Cloud configuration in sync with your data catalog or external metadata management systems.

This automation ensures that your metadata stays up-to-date and consistent across your ecosystem, supporting seamless governance and discoverability.


You are not logged in to Soda and are viewing the default public documentation. Learn more about Documentation access & licensing.

Last updated

Was this helpful?