Skip navigation links
A B C D E F G I J L M N O P R S T U W 

A

AbstractColumnReader<VECTOR extends org.apache.flink.table.data.vector.writable.WritableColumnVector> - Class in org.apache.flink.formats.parquet.vector.reader
Abstract ColumnReader.
AbstractColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
 
add(int, Object) - Method in interface org.apache.flink.formats.parquet.utils.ParentDataHolder
 
add(int, Object) - Method in class org.apache.flink.formats.parquet.utils.RowConverter
 
addElement(T) - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
 
afterReadPage() - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
After read a page, we may need some initialization.
afterReadPage() - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
 
applyPredicate(List<Expression>) - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 

B

BooleanColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
Boolean ColumnReader.
BooleanColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
 
build() - Method in class org.apache.flink.formats.parquet.ParquetTableSource.Builder
Builds the ParquetTableSource for this builder.
builder() - Static method in class org.apache.flink.formats.parquet.ParquetTableSource
 
Builder() - Constructor for class org.apache.flink.formats.parquet.ParquetTableSource.Builder
 
ByteColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
ByteColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
 
BytesColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
BytesColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
 

C

checkTypeName(PrimitiveType.PrimitiveTypeName) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
 
close() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
close() - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
 
close() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
 
columnarBatch - Variable in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
 
ColumnBatchFactory<SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Interface in org.apache.flink.formats.parquet.vector
Interface to create VectorizedColumnBatch.
ColumnReader<VECTOR extends org.apache.flink.table.data.vector.writable.WritableColumnVector> - Interface in org.apache.flink.formats.parquet.vector.reader
Read a batch of records for a column to WritableColumnVector from parquet data file.
computeMinBytesForDecimalPrecision(int) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
conf() - Method in class org.apache.flink.formats.parquet.utils.SerializableConfiguration
 
configure(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
convert(Row) - Method in class org.apache.flink.formats.parquet.ParquetAvroInputFormat
 
convert(Row) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
This ParquetInputFormat read parquet record as Row by default.
convert(Row) - Method in class org.apache.flink.formats.parquet.ParquetMapInputFormat
 
convert(Row) - Method in class org.apache.flink.formats.parquet.ParquetPojoInputFormat
 
convert(Row) - Method in class org.apache.flink.formats.parquet.ParquetRowInputFormat
 
convertAndGetIterator(long) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
Provides reading iterator after the records are written to the ParquetVectorizedInputFormat.ParquetReaderBatch.columnarBatch.
convertFields(List<Type>) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
convertParquetTypeToTypeInfo(Type) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
convertToParquetMessageType(String, RowType) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
create(FSDataOutputStream) - Method in class org.apache.flink.formats.parquet.ParquetWriterFactory
 
create(SplitT, ColumnVector[]) - Method in interface org.apache.flink.formats.parquet.vector.ColumnBatchFactory
 
createColumnReader(boolean, LogicalType, ColumnDescriptor, PageReader) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
 
createDecodingFormat(DynamicTableFactory.Context, ReadableConfig) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
createEncodingFormat(DynamicTableFactory.Context, ReadableConfig) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
createPartitionedFormat(Configuration, RowType, List<String>, PartitionFieldExtractor<SplitT>, int, boolean, boolean) - Static method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
Create a partitioned ParquetColumnarRowInputFormat, the partition columns can be generated by Path.
createReader(Configuration, SplitT) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 
createReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<RowData>>) - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
 
createReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>>) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 
createVectorFromConstant(LogicalType, Object, int) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
 
createWritableColumnVector(int, LogicalType, PrimitiveType) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
 
createWriter(OutputFile) - Method in interface org.apache.flink.formats.parquet.ParquetBuilder
Creates and configures a parquet writer to the given output file.
createWriter(OutputFile) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder.FlinkParquetBuilder
 
createWriterFactory(RowType, Configuration, boolean) - Static method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
Create a parquet BulkWriter.Factory.

D

decodeInt96ToTimestamp(boolean, Dictionary, int) - Static method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
decodeToBinary(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
decodeToDouble(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
decodeToFloat(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
decodeToInt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
decodeToLong(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
decodeToTimestamp(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
descriptor - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
 
dictionary - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
The dictionary, if this column has dictionary encoding.
DoubleColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
Double ColumnReader.
DoubleColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
 

E

end() - Method in class org.apache.flink.formats.parquet.utils.RowConverter
 
explainSource() - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 

F

factoryIdentifier() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
finish() - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
 
FixedLenBytesColumnReader<VECTOR extends org.apache.flink.table.data.vector.writable.WritableColumnVector> - Class in org.apache.flink.formats.parquet.vector.reader
Fixed length bytes ColumnReader, just for decimal.
FixedLenBytesColumnReader(ColumnDescriptor, PageReader, int) - Constructor for class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
 
FlinkParquetBuilder(RowType, Configuration, boolean) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder.FlinkParquetBuilder
 
FloatColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
FloatColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
 
flush() - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
 
forGenericRecord(Schema) - Static method in class org.apache.flink.formats.parquet.avro.ParquetAvroWriters
Creates a ParquetWriterFactory that accepts and writes Avro generic types.
forParquetSchema(MessageType) - Method in class org.apache.flink.formats.parquet.ParquetTableSource.Builder
Sets the Parquet schema of the files to read as a String.
forReflectRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.ParquetAvroWriters
Creates a ParquetWriterFactory for the given type.
forSpecificRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.ParquetAvroWriters
Creates a ParquetWriterFactory for an Avro specific type.
forType(Class<T>) - Static method in class org.apache.flink.formats.parquet.protobuf.ParquetProtoWriters
Creates a ParquetWriterFactory for the given type.
fromParquetType(MessageType) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
Converts Parquet schema to Flink Internal Type.

G

generate(ColumnVector[]) - Method in interface org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader.ColumnBatchGenerator
 
genPartColumnarRowReader(boolean, boolean, Configuration, String[], DataType[], Map<String, Object>, int[], int, Path, long, long) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
Util for generating partitioned ParquetColumnarRowSplitReader.
getAvroSchema() - Method in class org.apache.flink.formats.parquet.ParquetAvroInputFormat
 
getConverter(int) - Method in class org.apache.flink.formats.parquet.utils.RowConverter
 
getCurrentReadPosition() - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
Returns the current read position in the split, i.e., the current block and the number of records that were returned from that block.
getCurrentRecord() - Method in class org.apache.flink.formats.parquet.utils.RowMaterializer
 
getCurrentRow() - Method in class org.apache.flink.formats.parquet.utils.RowConverter
 
getCurrentState() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
getDataSet(ExecutionEnvironment) - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 
getDecimal(int, int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
 
getFieldNames() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
Get field names of read result.
getFieldTypes() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
Get field types of read result.
getPredicate() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
getProducedType() - Method in class org.apache.flink.formats.parquet.ParquetAvroInputFormat
 
getProducedType() - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
 
getProducedType() - Method in class org.apache.flink.formats.parquet.ParquetRowInputFormat
 
getReturnType() - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 
getRootConverter() - Method in class org.apache.flink.formats.parquet.utils.RowMaterializer
 
getTableSchema() - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 
getTimestampMillis(Binary) - Static method in class org.apache.flink.formats.parquet.utils.ParquetTimestampUtils
Returns GMT timestamp from binary encoded parquet timestamp (12 bytes - julian date + time of day nanos).
getWriteSupport(Configuration) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
 

I

IDENTIFIER - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
init(InitContext) - Method in class org.apache.flink.formats.parquet.utils.RowReadSupport
 
initialize(ParquetFileReader, Configuration) - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
 
IntColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
IntColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
 
isFilterPushedDown() - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 
isNullAt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
 
isSplittable() - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 

J

JULIAN_EPOCH_OFFSET_DAYS - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 

L

LIST_ARRAY_TYPE - Static variable in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
LIST_ELEMENT - Static variable in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
LIST_GROUP_NAME - Static variable in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
LongColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
LongColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
 

M

MAP_VALUE - Static variable in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
maxDefLevel - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
Maximum definition level for this column.
MESSAGE_ROOT - Static variable in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
MILLIS_IN_DAY - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 

N

NANOS_PER_MILLISECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
NANOS_PER_SECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
nextRecord(E) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
nextRecord() - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
Returns the next record.
nextRecord() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
 
numBatchesToCirculate(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
 
numBatchesToCirculate(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 

O

open(FileInputSplit) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
open(FileInputSplit) - Method in class org.apache.flink.formats.parquet.ParquetPojoInputFormat
 
optionalOptions() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
org.apache.flink.formats.parquet - package org.apache.flink.formats.parquet
 
org.apache.flink.formats.parquet.avro - package org.apache.flink.formats.parquet.avro
 
org.apache.flink.formats.parquet.protobuf - package org.apache.flink.formats.parquet.protobuf
 
org.apache.flink.formats.parquet.row - package org.apache.flink.formats.parquet.row
 
org.apache.flink.formats.parquet.utils - package org.apache.flink.formats.parquet.utils
 
org.apache.flink.formats.parquet.vector - package org.apache.flink.formats.parquet.vector
 
org.apache.flink.formats.parquet.vector.reader - package org.apache.flink.formats.parquet.vector.reader
 

P

ParentDataHolder - Interface in org.apache.flink.formats.parquet.utils
Interface for RowConverter for extracting nested value from parquet record.
PARQUET_SKIP_CORRUPTED_RECORD - Static variable in class org.apache.flink.formats.parquet.ParquetInputFormat
The config parameter which defines whether to skip corrupted record.
PARQUET_SKIP_WRONG_SCHEMA_SPLITS - Static variable in class org.apache.flink.formats.parquet.ParquetInputFormat
The config parameter which defines whether to skip file split with wrong schema.
ParquetAvroInputFormat - Class in org.apache.flink.formats.parquet
An implementation of ParquetInputFormat to read records from Parquet files and convert them to Avro GenericRecord.
ParquetAvroInputFormat(Path, MessageType) - Constructor for class org.apache.flink.formats.parquet.ParquetAvroInputFormat
 
ParquetAvroWriters - Class in org.apache.flink.formats.parquet.avro
Convenience builder to create ParquetWriterFactory instances for the different Avro types.
ParquetBuilder<T> - Interface in org.apache.flink.formats.parquet
A builder to create a ParquetWriter from a Parquet OutputFile.
ParquetBulkWriter<T> - Class in org.apache.flink.formats.parquet
A simple BulkWriter implementation that wraps a ParquetWriter.
ParquetBulkWriter(ParquetWriter<T>) - Constructor for class org.apache.flink.formats.parquet.ParquetBulkWriter
Creates a new ParquetBulkWriter wrapping the given ParquetWriter.
ParquetColumnarRowInputFormat<SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Class in org.apache.flink.formats.parquet
A ParquetVectorizedInputFormat to provide RowData iterator.
ParquetColumnarRowInputFormat(Configuration, RowType, int, boolean, boolean) - Constructor for class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
Constructor to create parquet format without extra fields.
ParquetColumnarRowInputFormat(Configuration, RowType, RowType, ColumnBatchFactory<SplitT>, int, boolean, boolean) - Constructor for class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
Constructor to create parquet format with extra fields created by ColumnBatchFactory.
ParquetColumnarRowSplitReader - Class in org.apache.flink.formats.parquet.vector
This reader is used to read a VectorizedColumnBatch from input split.
ParquetColumnarRowSplitReader(boolean, boolean, Configuration, LogicalType[], String[], ParquetColumnarRowSplitReader.ColumnBatchGenerator, int, Path, long, long) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
 
ParquetColumnarRowSplitReader.ColumnBatchGenerator - Interface in org.apache.flink.formats.parquet.vector
Interface to gen VectorizedColumnBatch.
ParquetDecimalVector - Class in org.apache.flink.formats.parquet.vector
Parquet write decimal as int32 and int64 and binary, this class wrap the real vector to provide DecimalColumnVector interface.
ParquetDecimalVector(ColumnVector) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
 
ParquetDictionary - Class in org.apache.flink.formats.parquet.vector
Parquet dictionary.
ParquetDictionary(Dictionary) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetDictionary
 
ParquetFileFormatFactory - Class in org.apache.flink.formats.parquet
Parquet format factory for file system.
ParquetFileFormatFactory() - Constructor for class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
ParquetInputFormat<E> - Class in org.apache.flink.formats.parquet
The base InputFormat class to read from Parquet files.
ParquetInputFormat(Path, MessageType) - Constructor for class org.apache.flink.formats.parquet.ParquetInputFormat
Read parquet files with given parquet file schema.
ParquetMapInputFormat - Class in org.apache.flink.formats.parquet
An implementation of ParquetInputFormat to read Map records from Parquet files.
ParquetMapInputFormat(Path, MessageType) - Constructor for class org.apache.flink.formats.parquet.ParquetMapInputFormat
 
ParquetMapInputFormat(Path) - Constructor for class org.apache.flink.formats.parquet.ParquetMapInputFormat
 
ParquetPojoInputFormat<E> - Class in org.apache.flink.formats.parquet
An implementation of ParquetInputFormat to read POJO records from Parquet files.
ParquetPojoInputFormat(Path, MessageType, PojoTypeInfo<E>) - Constructor for class org.apache.flink.formats.parquet.ParquetPojoInputFormat
 
ParquetPojoInputFormat(Path, PojoTypeInfo<E>) - Constructor for class org.apache.flink.formats.parquet.ParquetPojoInputFormat
 
ParquetProtoWriters - Class in org.apache.flink.formats.parquet.protobuf
Convenience builder for creating ParquetWriterFactory instances for Protobuf classes.
ParquetReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>>) - Constructor for class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
 
ParquetRecordReader<T> - Class in org.apache.flink.formats.parquet.utils
Customized ParquetRecordReader that support start read from particular position.
ParquetRecordReader(ReadSupport<T>, MessageType, FilterCompat.Filter) - Constructor for class org.apache.flink.formats.parquet.utils.ParquetRecordReader
 
ParquetRecordReader(ReadSupport<T>, MessageType) - Constructor for class org.apache.flink.formats.parquet.utils.ParquetRecordReader
 
ParquetRowDataBuilder - Class in org.apache.flink.formats.parquet.row
RowData of ParquetWriter.Builder.
ParquetRowDataBuilder(OutputFile, RowType, boolean) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
 
ParquetRowDataBuilder.FlinkParquetBuilder - Class in org.apache.flink.formats.parquet.row
Flink Row ParquetBuilder.
ParquetRowDataWriter - Class in org.apache.flink.formats.parquet.row
Writes a record to the Parquet API with the expected schema in order to be written to a file.
ParquetRowDataWriter(RecordConsumer, RowType, GroupType, boolean) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataWriter
 
ParquetRowInputFormat - Class in org.apache.flink.formats.parquet
An implementation of ParquetInputFormat to read Row records from Parquet files.
ParquetRowInputFormat(Path, MessageType) - Constructor for class org.apache.flink.formats.parquet.ParquetRowInputFormat
 
ParquetRowInputFormat(Path) - Constructor for class org.apache.flink.formats.parquet.ParquetRowInputFormat
 
ParquetSchemaConverter - Class in org.apache.flink.formats.parquet.utils
Schema converter converts Parquet schema to and from Flink internal types.
ParquetSchemaConverter() - Constructor for class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
 
ParquetSplitReaderUtil - Class in org.apache.flink.formats.parquet.vector
Util for generating ParquetColumnarRowSplitReader.
ParquetSplitReaderUtil() - Constructor for class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
 
ParquetTableSource - Class in org.apache.flink.formats.parquet
A TableSource to read Parquet files.
ParquetTableSource.Builder - Class in org.apache.flink.formats.parquet
Constructs an ParquetTableSource.
ParquetTimestampUtils - Class in org.apache.flink.formats.parquet.utils
Utility class for decoding INT96 encoded parquet timestamp to timestamp millis in GMT.
ParquetVectorizedInputFormat<T,SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Class in org.apache.flink.formats.parquet
Parquet BulkFormat that reads data from the file to VectorizedColumnBatch in vectorized mode.
ParquetVectorizedInputFormat(SerializableConfiguration, RowType, ColumnBatchFactory<SplitT>, int, boolean, boolean) - Constructor for class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 
ParquetVectorizedInputFormat.ParquetReaderBatch<T> - Class in org.apache.flink.formats.parquet
Reader batch that provides writing and reading capabilities.
ParquetWriterFactory<T> - Class in org.apache.flink.formats.parquet
A factory that creates a Parquet BulkWriter.
ParquetWriterFactory(ParquetBuilder<T>) - Constructor for class org.apache.flink.formats.parquet.ParquetWriterFactory
Creates a new ParquetWriterFactory using the given builder to assemble the ParquetWriter.
path(String) - Method in class org.apache.flink.formats.parquet.ParquetTableSource.Builder
Sets the path of Parquet files.
path(String, boolean) - Method in class org.apache.flink.formats.parquet.ParquetTableSource.Builder
Sets the path of the Parquet files.
prepareForRead(Configuration, Map<String, String>, MessageType, ReadSupport.ReadContext) - Method in class org.apache.flink.formats.parquet.utils.RowReadSupport
 
projectFields(int[]) - Method in class org.apache.flink.formats.parquet.ParquetTableSource
 

R

reachedEnd() - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
reachedEnd() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
Method used to check if the end of the input is reached.
reachEnd() - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
Checks if the record reader returned all records.
readBatch(int, int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
readBatch(int, int, WritableBooleanVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
 
readBatch(int, int, WritableByteVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
 
readBatch(int, int, WritableBytesVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
 
readBatch(int, int, WritableDoubleVector) - Method in class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
 
readBatch(int, int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
 
readBatch(int, int, WritableFloatVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
 
readBatch(int, int, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
 
readBatch(int, int, WritableLongVector) - Method in class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
 
readBatch(int, int, WritableShortVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
 
readBatch(int, int, WritableTimestampVector) - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
readBatchFromDictionaryIds(int, int, VECTOR, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
Decode dictionary ids to data.
readBatchFromDictionaryIds(int, int, WritableBooleanVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
 
readBatchFromDictionaryIds(int, int, WritableByteVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
 
readBatchFromDictionaryIds(int, int, WritableBytesVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
 
readBatchFromDictionaryIds(int, int, WritableDoubleVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
 
readBatchFromDictionaryIds(int, int, VECTOR, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
 
readBatchFromDictionaryIds(int, int, WritableFloatVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
 
readBatchFromDictionaryIds(int, int, WritableIntVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
 
readBatchFromDictionaryIds(int, int, WritableLongVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
 
readBatchFromDictionaryIds(int, int, WritableShortVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
 
readBatchFromDictionaryIds(int, int, WritableTimestampVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
readToVector(int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
Reads `total` values from this columnReader into column.
readToVector(int, VECTOR) - Method in interface org.apache.flink.formats.parquet.vector.reader.ColumnReader
 
recycle() - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
 
reopen(FileInputSplit, Tuple2<Long, Long>) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
requiredOptions() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 
restoreReader(Configuration, SplitT) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
 
RowConverter - Class in org.apache.flink.formats.parquet.utils
Extends from GroupConverter to convert an nested Parquet Record into Row.
RowConverter(MessageType, TypeInformation<?>) - Constructor for class org.apache.flink.formats.parquet.utils.RowConverter
 
RowConverter(GroupType, TypeInformation<?>, ParentDataHolder, int) - Constructor for class org.apache.flink.formats.parquet.utils.RowConverter
 
RowMaterializer - Class in org.apache.flink.formats.parquet.utils
Row materializer for RowReadSupport.
RowMaterializer(MessageType, TypeInformation<?>) - Constructor for class org.apache.flink.formats.parquet.utils.RowMaterializer
 
RowReadSupport - Class in org.apache.flink.formats.parquet.utils
A Parquet ReadSupport implementation for reading Parquet record as Row.
RowReadSupport() - Constructor for class org.apache.flink.formats.parquet.utils.RowReadSupport
 
runLenDecoder - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
Run length decoder for data and dictionary.

S

seek(long, long) - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
Moves the reading position to the given block and seeks to and reads the given record.
seekToRow(long) - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
Seek to a particular row number.
selectFields(String[]) - Method in class org.apache.flink.formats.parquet.ParquetAvroInputFormat
 
selectFields(String[]) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
Configures the fields to be read and returned by the ParquetInputFormat.
self() - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
 
SerializableConfiguration - Class in org.apache.flink.formats.parquet.utils
Wrap Configuration to a serializable class.
SerializableConfiguration(Configuration) - Constructor for class org.apache.flink.formats.parquet.utils.SerializableConfiguration
 
setFilterPredicate(FilterPredicate) - Method in class org.apache.flink.formats.parquet.ParquetInputFormat
 
setSkipCorruptedRecord(boolean) - Method in class org.apache.flink.formats.parquet.utils.ParquetRecordReader
 
ShortColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
ShortColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
 
start() - Method in class org.apache.flink.formats.parquet.utils.RowConverter
 
supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
Support lazy dictionary ids decode.
supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
 
supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 

T

TimestampColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
Timestamp ColumnReader.
TimestampColumnReader(boolean, ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
 
toParquetType(TypeInformation<?>, boolean) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
Converts Flink Internal Type to Parquet schema.

U

UTC_TIMEZONE - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
 

W

withConfiguration(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetTableSource.Builder
Sets a Hadoop Configuration for the Parquet Reader.
withoutExtraFields() - Static method in interface org.apache.flink.formats.parquet.vector.ColumnBatchFactory
 
write(RowData) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataWriter
It writes a record to Parquet.
A B C D E F G I J L M N O P R S T U W 
Skip navigation links

Copyright © 2014–2022 The Apache Software Foundation. All rights reserved.