Apache Parquet
Apache Parquet is a free and open-source column-oriented data storage format in the Apache Hadoop ecosystem. It is similar to RCFile and ORC, the other columnar-storage file formats in Hadoop, and is compatible with most of the data processing frameworks around Hadoop. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.
Initial release | 13 March 2013 |
---|---|
Stable release | 2.9.0
/ 6 October 2021 |
Repository | |
Written in | Java (reference implementation) |
Operating system | Cross-platform |
Type | Column-oriented DBMS |
License | Apache License 2.0 |
Website | parquet |
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.