Incremental extraction

Incremental extraction is useful when you want to extract only the data that has changed since the last extraction. This is useful when you have a large dataset and only a small portion of the data changes frequently.

To enable incremental extraction, you need to provide a column that can be used to identify the new rows. This column is called the partitionColumn.

metadata/extract/my_extract_config.sl.yml
version: 1
extract:
  connectionRef: "duckdb" # The database connection to use
  jdbcSchemas:
    - schema: "starbake"
      tables:
        - name: "order"               # table names or  "*" to extract all tables
          fullExport: false           # (optional) set to true to do a full export
          partitionColumn: "order_id" # (optional) column to use for incremental extraction
  ...