public class OrcReaderFactory extends Object implements FormatReaderFactory
ColumnarRow
records.FormatReaderFactory.Context
Modifier and Type | Field and Description |
---|---|
protected int |
batchSize |
protected List<OrcFilters.Predicate> |
conjunctPredicates |
protected org.apache.hadoop.conf.Configuration |
hadoopConfig |
protected org.apache.orc.TypeDescription |
schema |
Constructor and Description |
---|
OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize) |
Modifier and Type | Method and Description |
---|---|
static org.apache.orc.Reader |
createReader(org.apache.hadoop.conf.Configuration conf,
FileIO fileIO,
Path path) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader |
createReader(FormatReaderFactory.Context context) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch |
createReaderBatch(Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
Creates the
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format. |
protected final org.apache.hadoop.conf.Configuration hadoopConfig
protected final org.apache.orc.TypeDescription schema
protected final List<OrcFilters.Predicate> conjunctPredicates
protected final int batchSize
public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig, RowType readType, List<OrcFilters.Predicate> conjunctPredicates, int batchSize)
hadoopConfig
- the hadoop config for orc reader.conjunctPredicates
- the filter predicates that can be evaluated.batchSize
- the batch size of orc reader.public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(FormatReaderFactory.Context context) throws IOException
createReader
in interface FormatReaderFactory
IOException
public org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(Path filePath, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch, Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
OrcReaderBatch
structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format.public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf, FileIO fileIO, Path path) throws IOException
IOException
Copyright © 2023–2024 The Apache Software Foundation. All rights reserved.