FormatReaderMapping.Builder (Paimon : 1.2-SNAPSHOT API)

java.lang.Object
- org.apache.paimon.utils.FormatReaderMapping.Builder

Enclosing class:

FormatReaderMapping
```
public static class FormatReaderMapping.Builder
extends Object
```
Builder for FormatReaderMapping.

Constructor Summary

Constructors
Constructor and Description
`Builder(FileFormatDiscover formatDiscover, List<DataField> readTableFields, java.util.function.Function<TableSchema,List<DataField>> fieldsExtractor, List<Predicate> filters)`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`FormatReaderMapping`	`build(String formatIdentifier, TableSchema tableSchema, TableSchema dataSchema)` There are three steps here to build `FormatReaderMapping`:

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - Builder
```
public Builder(FileFormatDiscover formatDiscover,
               List<DataField> readTableFields,
               java.util.function.Function<TableSchema,List<DataField>> fieldsExtractor,
               @Nullable
               List<Predicate> filters)
```
- Method Detail
  - build
```
public FormatReaderMapping build(String formatIdentifier,
                                 TableSchema tableSchema,
                                 TableSchema dataSchema)
```
    There are three steps here to build FormatReaderMapping:
    1. Calculate the readDataFields, which is what we intend to read from the data schema. Meanwhile, generate the indexCastMapping, which is used to map the index of the readDataFields to the index of the data schema.
    2. Calculate the mapping to trim _KEY_ fields. For example: we want _KEY_a, _KEY_b, _FIELD_SEQUENCE, _ROW_KIND, a, b, c, d, e, f, g from the data, but actually we don't need to read _KEY_a and a, _KEY_b and b the same time, so we need to trim them. So we mapping it: read before: _KEY_a, _KEY_b, _FIELD_SEQUENCE, _ROW_KIND, a, b, c, d, e, f, g read after: a, b, _FIELD_SEQUENCE, _ROW_KIND, c, d, e, f, g and the mapping is [0,1,2,3,0,1,4,5,6,7,8], it converts the [read after] columns to [read before] columns.
    3. We want read much fewer fields than readDataFields, so we kick out the partition fields. We generate the partitionMappingAndFieldsWithoutPartitionPair which helps reduce the real read fields and tell us how to map it back.

Back to Paimon Website

Class FormatReaderMapping.Builder

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Detail

Builder

Method Detail

build

Back to Paimon Website