Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
kb:bigdata:spark:import_json [2017/11/26 16:32] – created yehudakb:bigdata:spark:import_json [2022/01/03 16:03] (current) – external edit 127.0.0.1
Line 6: Line 6:
  
 # import json # import json
-df = sc.wholeTextFiles('/user/yehudakorotkin/development/raw_data/mixpanel/*.json').flatMap(lambda x: json.loads(x[1])).toDF()+df = sc.wholeTextFiles('/user/yehuda/development/raw_data/*.json').flatMap(lambda x: json.loads(x[1])).toDF()
  
  
  
  
-jsonRDD = sc.wholeTextFiles("/user/yehudakorotkin/development/raw_data/mixpanel/mixpanel-*.json").map(lambda x: json.loads(x[1]))+jsonRDD = sc.wholeTextFiles("/user/yehuda/development/raw_data/file-*.json").map(lambda x: json.loads(x[1]))
 namesJson = sqlContext.read.json(jsonRDD) namesJson = sqlContext.read.json(jsonRDD)
 namesJson.printSchema namesJson.printSchema
kb/bigdata/spark/import_json.1511713941.txt.gz · Last modified: (external edit)
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0