Link Search Menu Expand Document

Soda Cloud architecture

scan-anatomy

Soda Cloud and Soda SQL work together to help you monitor your data and alert you when there is a data quality issue.

Installed in your environment, you use the Soda SQL command-line tool to scan data in your warehouses. Soda SQL uses a secure API to connect to Soda Cloud. When it completes a scan, it pushes the scan results to your Soda Cloud account where you can log in and examine the details in the web application. Notably, Soda SQL pushes metadata to Soda Cloud; by default all your data stays inside your private network. See Data security and privacy

When you create a monitor in Soda Cloud’s web application, Soda SQL uses the monitor settings to add new tests when it runs a scan on data in a specific warehouse. A monitor is essentially a way to create Soda SQL tests using the web application instead of adjusting scan YAML file contents directly in your Soda project directory.

Beyond creating them, you can use monitors to track data quality over time. Soda Cloud stores your scan results and prepares charts that represent the volume of failed tests in each scan. These visualizations of your scan results enable you to see where your data quality is improving or deteriorating over time.

Go further