public class OrcReaderFactory extends Object implements FormatReaderFactory
ColumnarRow
records.FormatReaderFactory.Context
Modifier and Type | Field and Description |
---|---|
protected int |
batchSize |
protected List<OrcFilters.Predicate> |
conjunctPredicates |
protected boolean |
deletionVectorsEnabled |
protected org.apache.hadoop.conf.Configuration |
hadoopConfig |
protected org.apache.orc.TypeDescription |
schema |
protected RowType |
tableType |
Constructor and Description |
---|
OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize,
boolean deletionVectorsEnabled) |
Modifier and Type | Method and Description |
---|---|
static org.apache.orc.Reader |
createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path,
FileIndexResult fileIndexResult) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader |
createReader(FormatReaderFactory.Context context) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch |
createReaderBatch(Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
Creates the
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format. |
protected final org.apache.hadoop.conf.Configuration hadoopConfig
protected final org.apache.orc.TypeDescription schema
protected final RowType tableType
protected final List<OrcFilters.Predicate> conjunctPredicates
protected final int batchSize
protected final boolean deletionVectorsEnabled
public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig, RowType readType, List<OrcFilters.Predicate> conjunctPredicates, int batchSize, boolean deletionVectorsEnabled)
hadoopConfig
- the hadoop config for orc reader.conjunctPredicates
- the filter predicates that can be evaluated.batchSize
- the batch size of orc reader.public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(FormatReaderFactory.Context context) throws IOException
createReader
in interface FormatReaderFactory
IOException
public org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(Path filePath, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch, Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
OrcReaderBatch
structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format.public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf, FileIO fileIO, Path path, FileIndexResult fileIndexResult) throws IOException
IOException
Copyright © 2023–2024 The Apache Software Foundation. All rights reserved.