Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
learn:bigdata:hdfs [2014/08/07 12:35] – created yehuda | learn:bigdata:hdfs [2022/01/03 16:03] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== HDFS (HaDoop File System) ====== | ====== HDFS (HaDoop File System) ====== | ||
HDFS is one of utils that included at [[Hadoop]] framework. | HDFS is one of utils that included at [[Hadoop]] framework. | ||
- | * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data | + | * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data |
- | * HDFS has [[Fault-tolerance]] - [[HDFS]] replicate same [[Data | + | * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data |
+ | * HDFS Master / Slaves architecture | ||
+ | |||
+ | ===== HDFS Features ===== | ||
+ | * Rack awarness - What happends if entire rack is lost ? | ||
+ | * Reliable storage | ||
+ | * High throughput | ||
+ | ===== Tools ===== | ||
+ | * [[dfsadmin]] | ||
+ | * [[fs shell]] | ||
+ | * Web-UIs | ||
+ | |||
+ | ===== Nodes ===== | ||
+ | ==== Data world ==== | ||
+ | * [[Name node]] - FS Ops, Block Mapping | ||
+ | * [[Secondary name node]] - Checkpoint Ops | ||
+ | * [[Data node]] - Block Ops, Replications | ||
+ | ==== MapReduce world ==== | ||
+ | See: [[MapReduce]] | ||
+ | * Master: [[Job tracker]] - controller of [[Task tracker|Task trackers]] | ||
+ | * [[Task tracker]] |