Contract Autopilot
Learn more about Contract Autopilot, a Soda AI-powered feature to create data contracts tailored to your data.
Contract Autopilot is currently available as a preview for Snowflake, Postgres, and Databricks data sources. Contact [email protected] for early access.
Contract Autopilot automatically generates initial data contracts for datasets with recommended data quality checks to accelerate onboarding in Soda Cloud. It is designed to help teams achieve fast time to value and iterate later.
Watch Contract Autopilot in action
Generate a contract with Autopilot
This feature requires the Manage contract permission on the select datasets. Learn more about permissions here: Dataset Attributes & Responsibilities
Generate
After selecting the datasets, Soda will provide a snapshot of which contracts will be generated, and a time estimate for generation. You can continue using Soda Cloud while contracts are being generated.
After reviewing what will be created, click on Generate [num] Contracts.

You can now view the generated contracts to review, edit or suggest changes.
This process will create a scan for each selected dataset. You can cancel the scan or review the execution logs in the Scans tab on the dataset, as well as on the general Scans page.
Successful scan
The generated contract is published
A blue checkmark icon indicates that it is an automatically generated contract
The icon will change to a green checkmark icon after there is a user edit, to indicate it is a user-generated contract..
A data contract history entry is published, indicating that Autopilot published a first version of the contract
[If configured] A "published contract" Webhook will be published (for example, to allow sync to Git)
Requirements & limitations
Autopilot is only available upon request. Contact us to enable it in your organization.
Autopilot is currently a Soda Cloud feature:
A Soda Agent is required
CLI or Python API are not available
APIs are not available
Data sources
Supported on Snowflake, Databricks, and Postgres connections
Contract Generation Scope
Autopilot can only generate data contracts for datasets that do not already have an existing data contract. Datasets with an existing contract are not eligible for contract generation via Autopilot.
Contract Autopilot is assistive and does not autonomously modify data or execute destructive operations. Users can review generated contracts before execution.
Privacy & security
Contract Autopilot is designed to minimize data exposure while generating meaningful contract recommendations.
The following data is used locally by Soda:
A sample of approximately 10,000 rows to compute profiling metrics
The following data is sent to Soda’s managed OpenAI API:
A sample of up to 100 rows
Computed profiling metrics
Data source type
Dataset schema
Access to Contract Autopilot is restricted to users with appropriate dataset permissions.
Learn more about Soda's AI security & privacy measures.
Next steps
After generation, you can:
Review generated checks
Validate assumptions with domain experts
Adjust schedules and thresholds
Iterate using Contract Copilot
Contract Autopilot vs. Create Contract
When you click Create Contract (or use the CLI generate command), Soda creates a contract skeleton based on the dataset schema. It includes columns only and no checks are added.
Contract Autopilot, on the other hand, analyzes sampled data and profiling metadata to automatically generate a fully populated contract with recommended checks.
You are not logged in to Soda and are viewing the default public documentation. Learn more about Documentation access & licensing.
Last updated
Was this helpful?


