Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
learn:bigdata:hdfs [2014/08/07 16:28] – yehuda | learn:bigdata:hdfs [2022/01/03 16:03] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 3: | Line 3: | ||
* HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]]. | * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data block|Data blocks]] and distrebute it over [[Cluster]]. | ||
* HDFS has [[wp> | * HDFS has [[wp> | ||
+ | * HDFS Master / Slaves architecture | ||
+ | |||
+ | ===== HDFS Features ===== | ||
+ | * Rack awarness - What happends if entire rack is lost ? | ||
+ | * Reliable storage | ||
+ | * High throughput | ||
===== Tools ===== | ===== Tools ===== | ||
* [[dfsadmin]] | * [[dfsadmin]] | ||
Line 9: | Line 15: | ||
===== Nodes ===== | ===== Nodes ===== | ||
- | | + | ==== Data world ==== |
- | * [[Secundery | + | |
- | * [[Data node]] | + | * [[Secondary |
+ | * [[Data node]] - Block Ops, Replications | ||
+ | ==== MapReduce world ==== | ||
+ | See: [[MapReduce]] | ||
+ | * Master: [[Job tracker]] - controller of [[Task tracker|Task trackers]] | ||
+ | * [[Task tracker]] |