Delta Lake Reader

FME can read attribute data stored in Delta Lake tables.

Delta Lake is an open-source storage layer that brings ACID (atomicity, consistency, isolation, durability) transaction capabilities to big data workloads. It is built on top of Apache Parquet and supports scalable, reliable data processing on data lakes. Delta Lake tables can be queried efficiently and incrementally, making them well-suited for analytical and batch processing use cases.

Delta Lake Product and System Requirements

Format

FME Platform

Operating System

Reader/Writer

FME Form

FME Flow

FME Flow Hosted

Windows 64-bit

Linux

Mac

Reader

Yes

Yes

Yes

Yes

Yes

Yes

Terminology: Delta Lake and FME

Delta Lake

FME

table

feature type

row

feature

column

attribute

Reader Overview

The Delta Lake reader reads a specified set of delta lake tables. Names and URLs of the tables must be defined in the connection parameters before they can be read.

Arbitrary WHERE clauses are fully supported per table.

 Using Delta Lake with the DatabaseQuerier Transformer in FME

Instead of using the Delta Lake reader directly, you can leverage its powerful SQL capabilities directly within the DatabaseQuerier transformer.

  • For example, you can use the DatabaseQuerier to efficiently perform analysis over several different Delta Lake tables.
  • By incorporating Delta Lake within the DatabaseQuerier, you can enhance your data processing workflows, taking full advantage of SQL's flexibility and Delta Lake’s performance.