Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
《梁冬说庄子·应帝王》内容简介:很多人都认为,《应帝王》讲的是帝王应该如何治理天下,其实,《应帝王》最终讲的道理是,如果我
《十岁前,父母给孩子的礼物》内容简介:本书倡导父母抓住孩子前十年的黄金成长时间,送给孩子受益一生的礼物——语言沟通能力和在
《要怎么收获,先那么栽》内容简介:以自己的努力定义自己的人生,不要让未来的你讨厌现在的自己;没拼过的青春不值一提,坚持梦想
《MATLAB计算机视觉实战》内容简介:本书以MATLAB8.X汉化版为工具,深入浅出地介绍了基于计算机视觉系统工具箱(ComputerVisionSys
作为jQueryMobile的入门级读物,BradBroulik所著的《jQueryMobile快速入门》以示例方式讲解了jQueryMobile的基本知识和核...
本书是日本平面设计师小矶裕司在自己所参与的设计项目过程中,对设计理念、设计执行所发出的诚恳思考。全书分为三大篇章,分别介
《写给架构师的Linux实践》内容简介:本书首先概述Linux项目的设计方法,然后讲解在设计此类项目时,所要注重的核心理念,以及在用
《中国圣书:悦读《论语》》内容简介:《论语》是一部记录孔子及其弟子言行的语录体著作,是儒家学派最重要的经典,大约成书于战国
软件测试(原书第2版),ISBN:9787111185260,作者:(美)佩腾(Patton,R.)著,张小松等译;张小松译作者简介 RonPatton具有
《jQuery高级编程》从开发人员的层次对iQuery提供了一个全面的介绍。另外还深入介绍了iQuery的很多高级特性。在《jQuery高级编程
EXCEL应用大全 本书特色 《Excel应用大全》一书适合各个层次的Excel用户,即可作为初学者的入门指南,又可作为中、高级用户的参考手册。书中大量的实例还...
本书是国内第一本“面向原因式”(Why-OrientedBook)、全面系统介绍FlashActionScript3的书籍。全书共分为5个部分。第一部分:A
Themobilecommunicationsmarketremainsthefastestgrowingsegmentoftheglobalcomputing...
主板维修技能实训(芯片级) 内容简介 本书结合大量图解与实例,循序渐进地讲解了主板的结构和电路组成,常用维修工具,元器件好坏的判定方法,总线插槽和测试点,以及接...
【内容简介】本书深入浅出地介绍了Redis的5种数据类型,并通过多个实用示例展示了Redis的用法。除此之外,书中还讲述了Redis的优
《百马人生,跑向东京》内容简介:从2010完成第一场马拉松,到2017年12月完成第100场马拉松,田同生“百马人生”的梦想变成了现实。
《断病如断案:中医如何看病》内容简介:本书为中医医案汇编图书,是中医专家根据多年临床诊疗经验,结合大量中医文献编写而成。全
《区块链技术进阶指南》内容简介:本书从区块链发展简史、账本模型、网络、共识、合约引擎及应用等多个方面进行系统介绍,希望帮助
《无师自通8:铅笔素描头像超精解析(修订版)》内容简介:素描是一切造型艺术的基础,有着独特的表现魅力,学习素描是通往艺术殿堂
JavaScript是一种脚本语言,已广泛用于Web应用开发。本书就是一本引导读者深入学习JavaScript,并能成为JavaScript专家的书。全书