The Optimized Row Columnar (ORC) file format provides a highly efficient way to store Hive data. It was designed to overcome limitations of the other Hive file formats. Using ORC files improves performance when Hive is reading, writing, and processing data.
Optimized Row Columnar (ORC) file format is based on Hive’s RCFile which was the standard format for storing tabular data in Hadoop for several years. ORC was introduced in Hive 0.11.
References
Related Tags