Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
learn:bigdata:hdfs [2014/08/07 16:28] yehudalearn:bigdata:hdfs [2022/01/03 16:03] (current) – external edit 127.0.0.1
Line 3: Line 3:
   * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]].   * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]].
   * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data block]] on X [[Data node|Data nodes]]. X is  [[Replication factor]]   * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data block]] on X [[Data node|Data nodes]]. X is  [[Replication factor]]
 +  * HDFS Master / Slaves architecture
 +
 +===== HDFS Features =====
 +  * Rack awarness - What happends if entire rack is lost ?
 +  * Reliable storage
 +  * High throughput
 ===== Tools ===== ===== Tools =====
   * [[dfsadmin]]   * [[dfsadmin]]
Line 9: Line 15:
  
 ===== Nodes ===== ===== Nodes =====
-  * [[Name node]] +==== Data world ==== 
-  * [[Secundery name node]] +  * [[Name node]] - FS Ops, Block Mapping 
-  * [[Data node]]+  * [[Secondary name node]] - Checkpoint Ops 
 +  * [[Data node]] - Block Ops, Replications 
 +==== MapReduce world ==== 
 +See: [[MapReduce]] 
 +  * Master: [[Job tracker]] - controller of [[Task tracker|Task trackers]] 
 +  * [[Task tracker]] 
learn/bigdata/hdfs.1407428922.txt.gz · Last modified: (external edit)
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0