Skip to main content

Incremental extraction

Incremental extraction is useful when you want to extract only the data that has changed since the last extraction. This is useful when you have a large dataset and only a small portion of the data changes frequently.

To enable incremental extraction, you need to provide a column that can be used to identify the new rows. This column is called the partitionColumn.

metadata/extract/my_extract_config.sl.yml
version: 1
extract:
connectionRef: "duckdb" # The database connection to use
jdbcSchemas:
- schema: "starbake"
tables:
- name: "order" # table names or "*" to extract all tables
fullExport: false # (optional) set to true to do a full export
partitionColumn: "order_id" # (optional) column to use for incremental extraction
...