OrcReaderFactory (Paimon : 1.2-SNAPSHOT API)

java.lang.Object
- org.apache.paimon.format.orc.OrcReaderFactory

All Implemented Interfaces:

FormatReaderFactory
```
public class OrcReaderFactory
extends Object
implements FormatReaderFactory
```
An ORC reader that produces a stream of ColumnarRow records.

Nested Class Summary
- Nested classes/interfaces inherited from interface org.apache.paimon.format.FormatReaderFactory
  FormatReaderFactory.Context

Field Summary

Fields
Modifier and Type	Field and Description
`protected int`	`batchSize`
`protected List<OrcFilters.Predicate>`	`conjunctPredicates`
`protected boolean`	`deletionVectorsEnabled`
`protected org.apache.hadoop.conf.Configuration`	`hadoopConfig`
`protected boolean`	`legacyTimestampLtzType`
`protected org.apache.orc.TypeDescription`	`schema`
`protected RowType`	`tableType`

Constructor Summary

Constructors
Constructor and Description
`OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig, RowType readType, List<OrcFilters.Predicate> conjunctPredicates, int batchSize, boolean deletionVectorsEnabled, boolean legacyTimestampLtzType)`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`static org.apache.orc.Reader`	`createReader(org.apache.hadoop.conf.Configuration conf, FileIO fileIO, Path path, RoaringBitmap32 selection)`
`org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader`	`createReader(FormatReaderFactory.Context context)`
`org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch`	`createReaderBatch(Path filePath, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch, Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)` Creates the `OrcReaderBatch` structure, which is responsible for holding the data structures that hold the batch data (column vectors, row arrays, ...) and the batch conversion from the ORC representation to the result format.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail

hadoopConfig

protected final org.apache.hadoop.conf.Configuration hadoopConfig

schema

protected final org.apache.orc.TypeDescription schema

tableType
```
protected final RowType tableType
```

conjunctPredicates

protected final List<OrcFilters.Predicate> conjunctPredicates

batchSize
```
protected final int batchSize
```

deletionVectorsEnabled

protected final boolean deletionVectorsEnabled

legacyTimestampLtzType

protected final boolean legacyTimestampLtzType

Constructor Detail

OrcReaderFactory

public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
                        RowType readType,
                        List<OrcFilters.Predicate> conjunctPredicates,
                        int batchSize,
                        boolean deletionVectorsEnabled,
                        boolean legacyTimestampLtzType)

Parameters:: hadoopConfig - the hadoop config for orc reader.; conjunctPredicates - the filter predicates that can be evaluated.; batchSize - the batch size of orc reader.

Method Detail

createReader

public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(FormatReaderFactory.Context context)
                                                                               throws IOException

Specified by:: createReader in interface FormatReaderFactory
Throws:: IOException

createReaderBatch

public org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(Path filePath,
                                                                                      org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
                                                                                      Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)

Creates the OrcReaderBatch structure, which is responsible for holding the data structures that hold the batch data (column vectors, row arrays, ...) and the batch conversion from the ORC representation to the result format.

createReader

public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf,
                                                 FileIO fileIO,
                                                 Path path,
                                                 @Nullable
                                                 RoaringBitmap32 selection)
                                          throws IOException

Throws:: IOException

Back to Paimon Website

Class OrcReaderFactory

Nested Class Summary

Nested classes/interfaces inherited from interface org.apache.paimon.format.FormatReaderFactory

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

hadoopConfig

schema

tableType

conjunctPredicates

batchSize

deletionVectorsEnabled

legacyTimestampLtzType

Constructor Detail

OrcReaderFactory

Method Detail

createReader

createReaderBatch

createReader

Back to Paimon Website