# Soda Cloud - Release notes

{% if visitor.claims.plan ===  %}
{% hint style="success" %}
You are **logged in to Soda** and seeing the **Free license** documentation. Learn more about [Documentation access & licensing](/reference/documentation-access-and-licensing.md).
{% endhint %}
{% endif %}

{% if visitor.claims.plan ===  %}
{% hint style="success" %}
You are **logged in to Soda** and seeing the **Team license** documentation. Learn more about [Documentation access & licensing](/reference/documentation-access-and-licensing.md).
{% endhint %}
{% endif %}

{% if visitor.claims.plan ===  %}
{% hint style="success" %}
You are **logged in to Soda** and seeing the **Enterprise license** documentation. Learn more about [Documentation access & licensing](/reference/documentation-access-and-licensing.md).
{% endhint %}
{% endif %}

{% if !(visitor.claims.plan ===  %}
{% hint style="info" %}
You are **not logged in to Soda** and are viewing the default public documentation. Learn more about [Documentation access & licensing](/reference/documentation-access-and-licensing.md).
{% endhint %}
{% endif %}

{% if visitor.claims.plan ===  %}
{% columns %}
{% column width="25%" %} <sup>20th March, 2026</sup>
{% endcolumn %}

{% column width="75%" %}
**Diagnostics Warehouse**

* Guardrails: Guardrails can now be configured [directly on a data source](broken://pages/E1KTiEL4c6zytpedbEZg#set-up-diagnostics-warehouse-for-your-data-source), providing a shared baseline for all its datasets. Any rule configured at the dataset level continues to override the corresponding data source rule for that dataset individually.
  {% endcolumn %}
  {% endcolumns %}
  {% endif %}

{% columns %}
{% column width="25%" %} <sup>13th March, 2026</sup>
{% endcolumn %}

{% column width="75%" %}

* [**IP allowlisting**](/organization-and-admin-settings/general-settings.md#ip-allowlist)**:** Organization admins can now restrict access to Soda Cloud to a defined set of IP addresses or CIDR ranges. When enabled, any request originating from an IP not on the allowlist is blocked, regardless of credentials. Configurable under **Organization Settings > Organization**.
* **Filter check results by attribute** *(private preview)*: On the dataset page's [**Checks**](/manage-issues/dataset-dashboard.md#checks-tab) tab, you can now filter checks by check attribute, last result, column, or check name. The health score panel updates dynamically to reflect only the filtered subset, and filtered views can be saved as collections.
* **Bulk attribute assignment in the contract editor** *(private preview)*: In the [contract editor](/data-testing/cloud-managed-data-contracts/author-a-contract-in-soda-cloud.md), you can now filter checks and assign attributes **in bulk** to all matching checks at once, instead of editing each check individually.
  {% endcolumn %}
  {% endcolumns %}

{% columns %}
{% column width="25%" %} <sup>6th March, 2026</sup>
{% endcolumn %}

{% column width="75%" %}

* [**Service accounts**](/organization-and-admin-settings/service-accounts.md)**:** Enabled service accounts, which provide non-human identities for automated pipelines, CI/CD systems, and API integrations, allowing teams to authenticate programmatically without tying access to an individual user.
* **Configurable** [**discovery scan schedule**](/onboard-data-sources-and-datasets/onboard-datasets-on-soda-cloud.md#configure-discovery-scan)**:** Discovery scans automatically detect new datasets and schema changes in your connected data sources.
  * The discovery scan schedule is configurable per data source.
  * The default discovery scan frequency was reduced from hourly to **once per day at a random time**. All existing discovery scans that were still on the hourly schedule were automatically migrated to daily. Scans that had already been manually configured to a non-hourly cadence were left untouched.
    {% endcolumn %}
    {% endcolumns %}

{% columns %}
{% column width="25%" %} <sup>27th February, 2026</sup>
{% endcolumn %}

{% column width="75%" %}

* Contract Copilot: added `column_expression` support, which lets Copilot generate checks against nested fields and mismatched types.
* Contract Autopilot: new feature (experimental, available upon request). Bootstraps a fully populated data contract from profiling + sampled data, for datasets that don't yet have a contract. Supported on Snowflake, Databricks, and Postgres.

{% if visitor.claims.plan ===  %}
**Diagnostics Warehouse**

* Implemented guardrails to give users control over how many failed rows the Diagnostics Warehouse stores per scan, preventing runaway storage costs from misconfigured or large-scale check failures.
  {% endif %}
  {% endcolumn %}
  {% endcolumns %}

{% columns %}
{% column width="25%" %} <sup>6th February, 2026</sup>
{% endcolumn %}

{% column width="75%" %}

* Updated dashboard filters and views on the [organization dashboard](/manage-issues/organization-dashboard.md)
  {% endcolumn %}
  {% endcolumns %}

{% columns %}
{% column width="25%" %} <sup>28th January, 2026</sup>
{% endcolumn %}

{% column width="75%" %}
**Everything new in Soda 4.0**

We are introducing a new data contracts engine and a unified cloud platform that brings **observability**, **AI**, and **data quality enforcement** together. This new version of Soda has transformed the software into a full data-quality platform by layering on **end-to-end data observability** and **collaborative data contracts**.

This marks the shift from a CLI-centric checks engine toward a unified, observability-driven data quality platform with a refined, three-tier **Core + Agent + Cloud** architecture, built-in contracts, orchestration, and deep integrations.

**Soda Core 4.0**

Data Contracts Engine: An **open-source engine** that formalizes the standard for defining and executing data contracts. A clean, data-quality–first syntax supporting **50+ built-in check types**.

**Soda Cloud 4.0**

Unified, self-driving data quality platform: It unites AI-powered contract generation, feedback-driven anomaly detection, deep diagnostic capabilities, and a faster, cleaner interface into a single platform where quality rules write themselves and bad data gets isolated instantly.

***

**The improvements in this release are numerous. Here are some of the highlights:**<br>

* Implemented **new automated dataset discovery and onboarding** process to easily onboard datasets with metric monitoring
  * **Schema-less data source onboarding:** Connect data sources without requiring predefined schemas
  * Rules-based dataset onboarding: Define rules to automatically onboard datasets matching specific criteria
  * New dataset onboarding UI: Redesigned interface for a smoother onboarding experience
  * Faster and more efficient dataset onboarding: Performance improvements reduce onboarding time
* **New data testing capabilities:**
  * Introduced **Data Contracts** to formalize requirements and expectations of user datasets
    * Supported checks:
      * Dataset-level: row count, schema, freshness, duplicate, failed rows, custom metrics.
      * Column-level: missing, invalid, duplicate, aggregate, failed rows, custom metrics.
    * Available on **supported data sources**: PostgreSQL, Databricks, Snowflake, BigQuery, Athena, SQL Server, Dremio, Redshift, Oracle, Synapse and Fabric
  * **Schedule your contract verifications** with a Data Contract Schedule.
  * Introduced the **new Collaborative Authoring UI** that allows business and technical users to collaborate on Data Contracts.
  * Introduced **Contract Requests** to request and propose changes for Data Contracts.
    * Stay up-to-date with your requests and proposals with e-mail notifications.
  * Introduced **Automated Contract Generation** to kickstart your Data Contracts.
  * Introduced **Secret Manager** to securely store your data source connection credentials.
* **Powerful data observability:**
  * Introduced **AI-powered metrics observability** at scale.
    * Supported monitors:
      * Dataset-level: total row count, total row count change, last modification time, schema changes, partition row count, most recent timestamp.
      * Column-level: missing values percentage, duplicate percentage, count, unique count, most recent timestamp, sum, minimum, maximum, average, standard deviation, variance, first quartile (Q1), median (Q2), third quartile (Q3), average length, minimum length, maximum length.
      * Group column-level monitor by any column to get insights per segment.
    * Available on **supported data sources**: PostgreSQL, Databricks, Snowflake, BigQuery, Athena, SQL Server, Dremio, Redshift, Oracle, Synapse and Fabric
    * Set a **schedule** for your monitors: daily, hourly, and custom intervals.
    * **Fine-tune** metric monitoring:
      * Set threshold strategy, exclusion values, and sensitivity,
      * Give feedback to improve detection,
      * Create incidents.
  * Introduced **Cloud API** to fetch your observability metrics.
  * Introduced **programmatic configuration** of metric monitoring
  * Introduced **historical metric collection**: calculate past data quality metrics retroactively for up to 365 days.
  * Introduced **metric monitor pages** with **interactive plots** to understand and fine-tune your monitors.
* With the new **Diagnostics Warehouse**, Soda stores all scans, failed records, and historical data quality issues directly in your own data warehouse:
  * **Full diagnostic information** in one place, including attributes.
  * Faster **root-cause analysis**: jump from a failed check to the exact failed rows, affected datasets/columns, and prior history to see if it’s a one-off issue or a pattern.
  * **Open & portable features**: it’s just tables in your warehouse. Query with SQL, power dashboards, join with lineage, incident, or cost data, and automate workflows.
  * **Security & Governance**: Diagnostics Warehouse stores tables in your own warehouse, giving you full control over security, retention and access.
    {% endcolumn %}
    {% endcolumns %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.soda.io/release-notes/soda-cloud-release-notes.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.