Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
learn:bigdata:hdfs [2014/08/07 13:01] yehudalearn:bigdata:hdfs [2022/01/03 16:03] (current) – external edit 127.0.0.1
Line 3: Line 3:
   * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]].   * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]].
   * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data block]] on X [[Data node|Data nodes]]. X is  [[Replication factor]]   * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data block]] on X [[Data node|Data nodes]]. X is  [[Replication factor]]
 +  * HDFS Master / Slaves architecture
 +
 +===== HDFS Features =====
 +  * Rack awarness - What happends if entire rack is lost ?
 +  * Reliable storage
 +  * High throughput
 +===== Tools =====
 +  * [[dfsadmin]]
 +  * [[fs shell]]
 +  * Web-UIs
 +
 +===== Nodes =====
 +==== Data world ====
 +  * [[Name node]] - FS Ops, Block Mapping
 +  * [[Secondary name node]] - Checkpoint Ops
 +  * [[Data node]] - Block Ops, Replications
 +==== MapReduce world ====
 +See: [[MapReduce]]
 +  * Master: [[Job tracker]] - controller of [[Task tracker|Task trackers]]
 +  * [[Task tracker]] 
learn/bigdata/hdfs.1407416466.txt.gz · Last modified: (external edit)
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0