This is an old revision of the document!
Table of Contents
Name node
MOST IMPORATANT FOR CLUSTER!
- in-memory location of every Data block in cluster / rack
- Controlls DataNodes
- FS Ops
- Block Mapping
Requriements
- Should have LOT OF RAM, LOT OF MEM, LOT OF POWER
Contains
- Map - Snapshot of entire file system (in Memory)
- Tracks all the files
- Tracks Replication value - can be configured by Replication factor
- Edit log - Track any client changes of information
To keep presistnece of the data on hard drive the Edit log marges with Map