Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
kb:bigdata:file_types [2018/11/21 14:52] โ€“ [Parquet] yehudakb:bigdata:file_types [2022/01/03 16:03] (current) โ€“ external edit 127.0.0.1
Line 2: Line 2:
  
 [[https://www.youtube.com/watch?v=_0Wpwj_gvzg|Spark + Parquet In Depth]] [[https://www.youtube.com/watch?v=_0Wpwj_gvzg|Spark + Parquet In Depth]]
 +[[https://www.youtube.com/watch?v=2vOfh064uUM|File Format Benchmark Avro JSON ORC and Parquet]]
 +[[https://www.youtube.com/watch?v=aIcxFIyL6xo|Berlin buzzwords18: Owen O'Malley โ€“ Fast Access To Your Complex Data - Avro, JSON, ORC, and Parquet]]
  
 ===== Parquet ===== ===== Parquet =====
Line 10: Line 12:
   * Binary format   * Binary format
   * Encoded & Compressed   * Encoded & Compressed
-  * +  * Support schema evolution - Format supports
 Limitation: Limitation:
   * Pushdown filters dont works on String / Binary ([[https://www.youtube.com/watch?v=_0Wpwj_gvzg|source]])   * Pushdown filters dont works on String / Binary ([[https://www.youtube.com/watch?v=_0Wpwj_gvzg|source]])
Line 21: Line 23:
     * Write mode append, that added embedded schema     * Write mode append, that added embedded schema
  
-==== vs ORC ====+=== vs ORC ===
   * indexed   * indexed
   * dont handles nested data   * dont handles nested data
kb/bigdata/file_types.1542811925.txt.gz ยท Last modified: (external edit)
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0