public class OrcWriterFactory extends Object implements FormatWriterFactory
FormatWriter
. The factory takes a user supplied Vectorizer
implementation to convert the element into an VectorizedRowBatch
.Constructor and Description |
---|
OrcWriterFactory(Vectorizer<InternalRow> vectorizer,
Properties writerProperties,
org.apache.hadoop.conf.Configuration configuration,
int writeBatchSize)
Creates a new OrcBulkWriterFactory using the provided Vectorizer, Hadoop Configuration, ORC
writer properties.
|
Modifier and Type | Method and Description |
---|---|
FormatWriter |
create(PositionOutputStream out,
String compression)
Creates a writer that writes to the given stream.
|
protected OrcFile.WriterOptions |
getWriterOptions() |
public OrcWriterFactory(Vectorizer<InternalRow> vectorizer, Properties writerProperties, org.apache.hadoop.conf.Configuration configuration, int writeBatchSize)
vectorizer
- The vectorizer implementation to convert input record to a
VectorizerRowBatch.writerProperties
- Properties that can be used in ORC WriterOptions.public FormatWriter create(PositionOutputStream out, String compression) throws IOException
FormatWriterFactory
create
in interface FormatWriterFactory
out
- The output stream to write the encoded data to.compression
- the compression value.IOException
- Thrown if the writer cannot be opened, or if the output stream throws an
exception.@VisibleForTesting protected OrcFile.WriterOptions getWriterOptions()
Copyright © 2023–2024 The Apache Software Foundation. All rights reserved.