public class OrcReaderFactory extends Object implements FormatReaderFactory
ColumnarRow records.FormatReaderFactory.Context| Modifier and Type | Field and Description |
|---|---|
protected int |
batchSize |
protected List<OrcFilters.Predicate> |
conjunctPredicates |
protected boolean |
deletionVectorsEnabled |
protected org.apache.hadoop.conf.Configuration |
hadoopConfig |
protected org.apache.orc.TypeDescription |
schema |
protected RowType |
tableType |
| Constructor and Description |
|---|
OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize,
boolean deletionVectorsEnabled) |
| Modifier and Type | Method and Description |
|---|---|
static org.apache.orc.Reader |
createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path,
FileIndexResult fileIndexResult) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader |
createReader(FormatReaderFactory.Context context) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch |
createReaderBatch(Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
Creates the
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format. |
protected final org.apache.hadoop.conf.Configuration hadoopConfig
protected final org.apache.orc.TypeDescription schema
protected final RowType tableType
protected final List<OrcFilters.Predicate> conjunctPredicates
protected final int batchSize
protected final boolean deletionVectorsEnabled
public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize,
boolean deletionVectorsEnabled)
hadoopConfig - the hadoop config for orc reader.conjunctPredicates - the filter predicates that can be evaluated.batchSize - the batch size of orc reader.public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(FormatReaderFactory.Context context) throws IOException
createReader in interface FormatReaderFactoryIOExceptionpublic org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(Path filePath, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch, Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format.public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path,
@Nullable
FileIndexResult fileIndexResult)
throws IOException
IOExceptionCopyright © 2023–2025 The Apache Software Foundation. All rights reserved.