Link Search Menu Expand Document

Connect Soda to Trino

Last modified on 26-Apr-24

For Soda to run quality scans on your data, you must configure it to connect to your data source.
To learn how to set up Soda and configure it to connect to your data sources, see Get started.

Connection configuration reference

Install package: soda-trino

Reference Trino documentation for assistance.

data_source my_datasource_name:
  type: trino
  host: 127.0.0.1
  port: "5432"
  username: simple
  password: simple_pass
  catalog: hive
  schema: public
  source: 
  http_headers: 
  client_tags: ["test","test2"]
Property Required Notes
type required Identify the type of data source for Soda.
host required Provide a host identifier.
port optional Provide a port identifier.
username required Consider using system variables to retrieve this value securely using, for example, ${TRINO_USER}.
password required Consider using system variables to retrieve this value securely using, for example, ${TRINO_PASSWORD}.
catalog required Provide an identifier for the catalog which contains schemas and which references a data source using a connector. See Catalog in the Trino documentation.
schema required Provide an identifier for the schema in which your dataset exists.
source optional  
http_headers optional Provide any HTTP headers as needed. See Trino documentation for details.
client_tags optional Provide a list of tag strings to identify Trino resource groups. See Trino documentation for details.

Test the data source connection

To confirm that you have correctly configured the connection details for the data source(s) in your configuration YAML file, use the test-connection command. If you wish, add a -V option to the command to returns results in verbose mode in the CLI.

soda test-connection -d my_datasource -c configuration.yml -V

Supported data types

Category Data type
text CHAR, VARCHAR
number NUMBER, INT, INTEGER, BIGINT, SMALLINT, TINYINT, BYTEINT, FLOAT, FLOAT4, FLOAT8, DOUBLE, DOUBLE PRECISION, REAL
time DATE, DATETIME, TIME, TIMESTAMP, TIMESTAMPT_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ




Was this documentation helpful?

What could we do to improve this page?

Documentation always applies to the latest version of Soda products
Last modified on 26-Apr-24