This is an old revision of the document!
Table of Contents
File types
Parquet
- Better Column selecting
- Columnar format
- Binary format
- Encoded & Compressed
ORC
- Nested Data
- Columnar format
- Predicate pushdown (Min max + bloomfilters)
- ACID support / cannot add
- suggested to streaming (source)