ColumnMetaData
Description for column metadata.
Properties
- $_TSPEC : mixed
- $bloom_filter_length : int
- Size of Bloom filter data including the serialized header, in bytes.
- $bloom_filter_offset : int
- Byte offset from beginning of file to Bloom filter data. *.
- $codec : int
- Compression codec *.
- $data_page_offset : int
- Byte offset from beginning of file to first data page *.
- $dictionary_page_offset : int
- Byte offset from the beginning of file to first (only) dictionary page *.
- $encoding_stats : array<string|int, PageEncodingStats>
- Set of all encodings used for pages in this column chunk.
- $encodings : array<string|int, int>
- Set of all encodings used for this column. The purpose is to validate whether we can decode those pages. *.
- $index_page_offset : int
- Byte offset from beginning of file to root index page *.
- $isValidate : mixed
- $key_value_metadata : array<string|int, KeyValue>
- Optional key/value metadata *.
- $num_values : int
- Number of values in this column *.
- $path_in_schema : array<string|int, string>
- Path in schema *.
- $size_statistics : SizeStatistics
- Optional statistics to help estimate total memory when converted to in-memory representations. The histograms contained in these statistics can also be useful in some cases for more fine-grained nullability/list length filter pushdown.
- $statistics : Statistics
- optional statistics for this column chunk.
- $total_compressed_size : int
- total byte size of all compressed, and potentially encrypted, pages in this column chunk (including the headers) *.
- $total_uncompressed_size : int
- total byte size of all uncompressed pages in this column chunk (including the headers) *.
- $type : int
- Type of this column *.
Methods
- __construct() : mixed
- getName() : mixed
- read() : mixed
- write() : mixed
Properties
$_TSPEC
public
static mixed
$_TSPEC
= [1 => ['var' => 'type', 'isRequired' => true, 'type' => \Thrift\Type\TType::I32, 'class' => '\Flow\Parquet\Thrift\Type'], 2 => ['var' => 'encodings', 'isRequired' => true, 'type' => \Thrift\Type\TType::LST, 'etype' => \Thrift\Type\TType::I32, 'elem' => ['type' => \Thrift\Type\TType::I32, 'class' => '\Flow\Parquet\Thrift\Encoding']], 3 => ['var' => 'path_in_schema', 'isRequired' => true, 'type' => \Thrift\Type\TType::LST, 'etype' => \Thrift\Type\TType::STRING, 'elem' => ['type' => \Thrift\Type\TType::STRING]], 4 => ['var' => 'codec', 'isRequired' => true, 'type' => \Thrift\Type\TType::I32, 'class' => '\Flow\Parquet\Thrift\CompressionCodec'], 5 => ['var' => 'num_values', 'isRequired' => true, 'type' => \Thrift\Type\TType::I64], 6 => ['var' => 'total_uncompressed_size', 'isRequired' => true, 'type' => \Thrift\Type\TType::I64], 7 => ['var' => 'total_compressed_size', 'isRequired' => true, 'type' => \Thrift\Type\TType::I64], 8 => ['var' => 'key_value_metadata', 'isRequired' => false, 'type' => \Thrift\Type\TType::LST, 'etype' => \Thrift\Type\TType::STRUCT, 'elem' => ['type' => \Thrift\Type\TType::STRUCT, 'class' => '\Flow\Parquet\Thrift\KeyValue']], 9 => ['var' => 'data_page_offset', 'isRequired' => true, 'type' => \Thrift\Type\TType::I64], 10 => ['var' => 'index_page_offset', 'isRequired' => false, 'type' => \Thrift\Type\TType::I64], 11 => ['var' => 'dictionary_page_offset', 'isRequired' => false, 'type' => \Thrift\Type\TType::I64], 12 => ['var' => 'statistics', 'isRequired' => false, 'type' => \Thrift\Type\TType::STRUCT, 'class' => '\Flow\Parquet\Thrift\Statistics'], 13 => ['var' => 'encoding_stats', 'isRequired' => false, 'type' => \Thrift\Type\TType::LST, 'etype' => \Thrift\Type\TType::STRUCT, 'elem' => ['type' => \Thrift\Type\TType::STRUCT, 'class' => '\Flow\Parquet\Thrift\PageEncodingStats']], 14 => ['var' => 'bloom_filter_offset', 'isRequired' => false, 'type' => \Thrift\Type\TType::I64], 15 => ['var' => 'bloom_filter_length', 'isRequired' => false, 'type' => \Thrift\Type\TType::I32], 16 => ['var' => 'size_statistics', 'isRequired' => false, 'type' => \Thrift\Type\TType::STRUCT, 'class' => '\Flow\Parquet\Thrift\SizeStatistics']]
$bloom_filter_length
Size of Bloom filter data including the serialized header, in bytes.
public
int
$bloom_filter_length
Added in 2.10 so readers may not read this field from old files and it can be obtained after the BloomFilterHeader has been deserialized. Writers should write this field so readers can read the bloom filter in a single I/O.
$bloom_filter_offset
Byte offset from beginning of file to Bloom filter data. *.
public
int
$bloom_filter_offset
$codec
Compression codec *.
public
int
$codec
$data_page_offset
Byte offset from beginning of file to first data page *.
public
int
$data_page_offset
$dictionary_page_offset
Byte offset from the beginning of file to first (only) dictionary page *.
public
int
$dictionary_page_offset
$encoding_stats
Set of all encodings used for pages in this column chunk.
public
array<string|int, PageEncodingStats>
$encoding_stats
This information can be used to determine if all data pages are dictionary encoded for example *.
$encodings
Set of all encodings used for this column. The purpose is to validate whether we can decode those pages. *.
public
array<string|int, int>
$encodings
$index_page_offset
Byte offset from beginning of file to root index page *.
public
int
$index_page_offset
$isValidate
public
static mixed
$isValidate
= false
$key_value_metadata
Optional key/value metadata *.
public
array<string|int, KeyValue>
$key_value_metadata
$num_values
Number of values in this column *.
public
int
$num_values
$path_in_schema
Path in schema *.
public
array<string|int, string>
$path_in_schema
$size_statistics
Optional statistics to help estimate total memory when converted to in-memory representations. The histograms contained in these statistics can also be useful in some cases for more fine-grained nullability/list length filter pushdown.
public
SizeStatistics
$size_statistics
$statistics
optional statistics for this column chunk.
public
Statistics
$statistics
$total_compressed_size
total byte size of all compressed, and potentially encrypted, pages in this column chunk (including the headers) *.
public
int
$total_compressed_size
$total_uncompressed_size
total byte size of all uncompressed pages in this column chunk (including the headers) *.
public
int
$total_uncompressed_size
$type
Type of this column *.
public
int
$type
Methods
__construct()
public
__construct([mixed $vals = null ]) : mixed
Parameters
- $vals : mixed = null
getName()
public
getName() : mixed
read()
public
read(mixed $input) : mixed
Parameters
- $input : mixed
write()
public
write(mixed $output) : mixed
Parameters
- $output : mixed