Pro Hadoop

You've heard the hype about Hadoop: it runs petabyte–scale data mining tasks insanely fast, it runs gigantic tasks on clouds for absurdly cheap, it's been heavily committed to by tech giants like IBM, Yahoo!, and the Apache Project, and it's completely open-source (thus free). But wha...

Full description

Bibliographic Details
Main Author: Venner, Jason (Author)
Corporate Author: SpringerLink (Online service)
Format: Electronic eBook
Language:English
Published: Berkeley, CA : Apress, 2009.
Subjects:
Online Access:Full Text via HEAL-Link
Table of Contents:
  • Getting Started with Hadoop Core
  • The Basics of a MapReduce Job
  • The Basics of Multimachine Clusters
  • HDFS details for Multimachine Clusters
  • MapReduce Details for Multimachine Clusters
  • Tuning Your MapReduce Jobs
  • Unit Testing and Debugging
  • Advanced and Alternate MapReduce Techniques
  • Solving Problems with Hadoop
  • Projects Based On Hadoop and Future Directions.