Delta Lake Reader Parameters

Database Connection

Table Name Mapping

As part of the Delta Lake connection, you must specify the URLs of the delta tables you wish to access, along with a name for each table. These names can be used to conveniently refer to delta table URLs when selecting which tables to read in the reader, and when writing SQL queries against a Delta Lake connection in the DatabaseExecutor.

Cloud Provider

As part of the Delta Lake connection, you must specify the cloud storage platform where the delta tables are hosted. Each choice requires additional configuration for authenticated access. The currently supported platforms are:

Amazon Web Services

AWS supports 3 credential types:

  1. AWS SDK Default Authentication – The AWS CLI default credential precedence (described here) will be used.

  2. SSO – The AWS CLI single-sign-on credentials are used. You must run aws sso login using the AWS CLI.

  3. Secret Key – An AWS IAM secret access key is used.

All credential types require the AWS region of the S3 bucket containing the delta table to be specified (e.g., us-west-1).

Azure

Azure supports 4 credential types:

  1. Azure SDK Default Authentication – Azure’s default authentication flow will be used.

  2. SSO – The Azure CLI single-sign-on credentials are used. You must run az login using the Azure CLI.

  3. Account Key – Specify the Azure storage account name and secret key.

  4. Connection String – Any Azure storage connection string can be used. Useful for high configuration.

Google Cloud Storage

GCS supports authentication via Access Key and Secret obtained from the Cloud Storage dashboard settings, under the Interoperability tab.

Constraints

Schema Attributes

Spatial