Soda documentation v3

Soda is a data quality platform that provides tools to monitor, test, and improve data quality across all stacks.

Welcome to the Soda documentation hub, your one-stop resource for everything you need to know about Soda’s data quality platform. Dive into our guides, tutorials, reference materials, and integration pages to learn how keep your data quality fresh across your entire stack.

Take a sip of Soda

Soda v3 vs v4

Soda v3 is a checks-first, CLI-driven tool and it has limited anomaly detection features.

Soda v4 has incorporated collaborative data contracts and end-to-end observability tools to become a unified data-quality platform for all

Capability
Soda v3
Soda v4

Data Testing

(Checks)

CLI-centric checks written in YAML/SodaCL, run via the Python library or Agent.

Still supports YAML/SodaCL checks.

Adds full Data Testing workflows in both CLI and Web UI.

Data Observability

(Monitoring)

Anomaly dashboards provide threshold-based monitoring configured via Soda Cloud.

Metric Monitoring leverages an in-house anomaly detection algorithm to monitor data and metadata metric trends, and provide built-in alerts via Soda Cloud.

Data Contracts

File-based contracts executed via CLI/Git; verification via soda scan.

Collaborative data contracts: file-based and UI-based, executed via CLI/Git or the Soda Cloud UI.


📚 Guides & Tutorials

Learn core concepts and best practices:

🔌 Integrations

Extend Soda into your existing tools and workflows:

Integrations

📖 Reference

Detailed command, API, and configuration docs:

Reference


💬 Community & Support

Need help or want to contribute?


Still have questions? Use the search bar above or reach out through our community channels for additional help.

Last updated

Was this helpful?