Welcome
Posted January 30, 2007 byCategories: Uncategorized
Hadoop is a framework for running applications on large clusters of commodity hardware. Hadoop implements a computational paradigm named map/reduce, where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both map/reduce and the distributed file system are designed so that node failures are automatically handled by the framework.
The contributors on this blog actively contribute to Hadoop. We will be writing here about cool new features of Hadoop, our thoughts about where Hadoop is heading, and about our users’ experiences with Hadoop.