This is an old revision of the document!


Table of Contents

File types

Parquet

  • Better Column selecting
  • Columnar format
  • Binary format
  • Encoded & Compressed

ORC

  • Nested Data
  • Columnar format
  • Predicate pushdown (Min max + bloomfilters)
  • ACID support / cannot add
  • suggested to streaming (source)

Avro

kb/bigdata/file_types.1542810191.txt.gz · Last modified: (external edit)
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0