Table of Contents
Hadoop is framework that has set of tools to distrebute and proccess data over clasters.
Single node Cluser
- Standalon mode - all hadoop components run under single JVM
- Pesodo Destributed - each deamon runs under seperated JVM
- Fully Destributed - each deamon runs under seperated maching
Hadoop Technology stack
see more at http://incubator.apache.org/
- New-York Times - Want to convert 4 TB of articales to PDF. thay did it with AWS less then 24 hours and it cost them about $240!