Apache Parquet Writer Parameters
Compression Type
The compression type to apply to each column when writing the file. Use this option to compress and reduce file size.
- UNCOMPRESSED – This is the default.
- ZSTD – This option provides a good compression ratio across diverse datasets. ZSTD works well with CHAR and VARCHAR columns that store a wide range of long and short strings, including JSON strings.
- Snappy – This option provides a reasonable compression ratio, but fast compression speeds.
File Version
Determines use of Parquet Format version types. Some external Parquet implementations may not support types added in the 2.0.0 format version. The option is defaulted to write version 2.0. If you want to maximize compatibility of your files you may want to change the parameter to 1.0.