First Row
This documentation is for an unreleased version of Apache Paimon. We recommend you use the latest stable version.

First Row #

By specifying 'merge-engine' = 'first-row', users can keep the first row of the same primary key. It differs from the deduplicate merge engine that in the first-row merge engine, it will generate insert only changelog.

first-row merge engine only supports none and lookup changelog producer. For streaming queries must be used with the lookup changelog producer.
  1. You can not specify sequence.field.
  2. Not accept DELETE and UPDATE_BEFORE message. You can config ignore-delete to ignore these two kinds records.
  3. Visibility guarantee: Tables with First Row engine, the files with level 0 will only be visible after compaction. So by default, compaction is synchronous, and if asynchronous is turned on, there may be delays in the data.

This is of great help in replacing log deduplication in streaming computation.

Edit This Page
Copyright © 2024 The Apache Software Foundation. Apache Paimon, Paimon, and its feather logo are trademarks of The Apache Software Foundation.