public class OrcReaderFactory extends Object implements FormatReaderFactory
ColumnarRow records.FormatReaderFactory.Context| Modifier and Type | Field and Description |
|---|---|
protected int |
batchSize |
protected List<OrcFilters.Predicate> |
conjunctPredicates |
protected org.apache.hadoop.conf.Configuration |
hadoopConfig |
protected org.apache.orc.TypeDescription |
schema |
| Constructor and Description |
|---|
OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize) |
| Modifier and Type | Method and Description |
|---|---|
static org.apache.orc.Reader |
createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader |
createReader(FormatReaderFactory.Context context) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch |
createReaderBatch(Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
Creates the
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format. |
protected final org.apache.hadoop.conf.Configuration hadoopConfig
protected final org.apache.orc.TypeDescription schema
protected final List<OrcFilters.Predicate> conjunctPredicates
protected final int batchSize
public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize)
hadoopConfig - the hadoop config for orc reader.conjunctPredicates - the filter predicates that can be evaluated.batchSize - the batch size of orc reader.public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(FormatReaderFactory.Context context) throws IOException
createReader in interface FormatReaderFactoryIOExceptionpublic org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(Path filePath, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch, Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format.public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path)
throws IOException
IOExceptionCopyright © 2023–2024 The Apache Software Foundation. All rights reserved.