Diagnostics Warehouse

Diagnostics Warehouse provides a clear, detailed view of the state of data checks while allowing access to failed rows in order to take a closer look and resolve data quality issues.

circle-info

Can't find what you're looking for? Get access to Diagnostics Warehouse private documentation:

  1. Log into Soda

  2. Refresh this page (Soda and docs must be open in the same browser)

Note: Your organization must be under an Enterprise license to view private docs. Learn more about Documentation access & licensing.

Overview

What is Diagnostics Warehouse?

The Diagnostics Warehouse is a place inside your data warehouse (in one or multiple schemas) reserved for Soda to store data quality information.


With Diagnostics Warehouse, Soda stores all scans, failed records, and historical data quality issues directly in your own data warehouse. Nothing is stored outside. This gives you the ability to run diagnostics, resolve issues, and see exactly why problems happen. You can go as deep as you need, from a single record to a full dataset.

Each time a Soda scan runs, Diagnostics Warehouse stores failed rows together with check, scan and dataset attributes and metadata. With that information, data teams can quickly diagnose and resolve issues at both row and dataset level. Additionally, Soda's Diagnostics Warehouse makes it easier for teams to build on top of Soda's outcomes to set up operation workflows, and connect to BI tools you already know and trust.

Features & capabilities

  • Full diagnostic information in one place, including attributes.

  • Faster root-cause analysis: jump from a failed check to the exact failed rows, affected datasets/columns, and prior history to see if it’s a one-off issue or a pattern.

  • Open & portable features: it’s just tables in your warehouse. Query with SQL, power dashboards, join with lineage, incident, or cost data, and automate workflows.

  • Security & Governance: Diagnostics Warehouse stores tables in your own warehouse, giving you full control over security, retention and access.

Security & governance

  • Data minimization: Diagnostics Warehouse stores metadata about runs and checks and, for row-level checks, it only stores failed rows when the option is enabled.

  • Warehouse residency: Diagnostics are not stored in Soda. They live in your analytics warehouse, respecting your access controls, encryption, and audit trails.

  • No destructive access: Soda does not require permissions to delete data. It only evaluates checks, stores metadata, and moves failed rows into a designated location. Cleanup and garbage-collection queries are provided, but Soda never drops tables itself.


Get started

  1. Enable Diagnostics Warehouse in your Soda data source settings.

  2. Grant the service identity permission to create and write to the Diagnostics Warehouse schema in your warehouse.

  3. Run your checks; Diagnostics Warehouse tables populate automatically.

  4. Query your warehouse and connect to your BI tools to start exploring.

Next: to enable Diagnostics Warehouse in your organization, reach out to Soda at [email protected]envelope.


circle-info

You are not logged in to Soda and are viewing the default public documentation. Learn more about Documentation access & licensing.

Last updated

Was this helpful?