Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| learn:bigdata:hdfs [2014/08/07 12:35] – created yehuda | learn:bigdata:hdfs [2022/01/03 16:03] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== HDFS (HaDoop File System) ====== | ====== HDFS (HaDoop File System) ====== | ||
| HDFS is one of utils that included at [[Hadoop]] framework. | HDFS is one of utils that included at [[Hadoop]] framework. | ||
| - | * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data | + | * HDFS knows to handle large file (Ex. 1ptb) and split it to [[Data |
| - | * HDFS has [[Fault-tolerance]] - [[HDFS]] replicate same [[Data | + | * HDFS has [[wp>Fault tolerance]] - [[HDFS]] replicate same [[Data |
| + | * HDFS Master / Slaves architecture | ||
| + | |||
| + | ===== HDFS Features ===== | ||
| + | * Rack awarness - What happends if entire rack is lost ? | ||
| + | * Reliable storage | ||
| + | * High throughput | ||
| + | ===== Tools ===== | ||
| + | * [[dfsadmin]] | ||
| + | * [[fs shell]] | ||
| + | * Web-UIs | ||
| + | |||
| + | ===== Nodes ===== | ||
| + | ==== Data world ==== | ||
| + | * [[Name node]] - FS Ops, Block Mapping | ||
| + | * [[Secondary name node]] - Checkpoint Ops | ||
| + | * [[Data node]] - Block Ops, Replications | ||
| + | ==== MapReduce world ==== | ||
| + | See: [[MapReduce]] | ||
| + | * Master: [[Job tracker]] - controller of [[Task tracker|Task trackers]] | ||
| + | * [[Task tracker]] | ||