Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
《操作系统概念》(第6版翻译版)是讨论了操作系统中的基本概念和算法,并对大量实例(如Linux系统)进行了研究。全书内容共分七部分
《成器之道:史前至宋的陶瓷造型艺术》内容简介:本书从艺术史的角度对史前至秦汉、隋唐、两宋这几个时期中国陶瓷的器形和艺术风格
《区域现代化基本理论研究》内容简介:本书概述了区域现代化探索的一些基本理论问题,包括政治区域现代化、经济区域现代化、文化区
《学校是比家大一点的地方(全2册)》内容简介:在北京,有这样一所很“土”的学校,叫一土学校。创办之初,这所学校只有三间教室,
未来十年,将是中国农产品商业品牌崛起的“黄金十年”。中国,能不能出现“下一个褚橙”?中国,能不能出现与佳沛(Zespri)、都
《机器学习算法(原书第2版)》内容简介:本书介绍了数据科学领域常用的所有重要机器学习算法以及TensorFlow和特征工程等相关内容。
HTML5、CSS3和JavaScript技术是网页设计的精髓,《精通HTML5+CSS3+JavaScript网页设计》以应用实例和综合实战案例的形式逐一详解
Learnhowtocreateresponsive,data-drivenwebsiteswithPHP,MySQL,andJavaScript-whethe...
计算机GBK汉字输入法速查字典 目录 凡例汉语拼音编码索引部首索引(一)部首目录(二)部首索引表(三)难检字笔画索引表四角号码索引字典正文附录四角号码查字法汉字...
《工匠革命:制造业的精神与文化变迁》内容简介:本书从人类制造业的历史演化及其文化变迁出发,对工匠精神的起源、形成与发展进行
Ruby for Rails-(中文版) 本书特色 本书是一部专门为Rails实践而写的经典Ruby著作,由四部分组成,共17章。**部分讲述Ruby和Rail...
《实用软件架构》内容简介:本书由IBM杰出工程师、首席技术官Tilak Mitra亲笔撰写,Amazon全五星评价。全书通过一整套实用的案例研
《任正非与华为神话》内容简介:华为作为中国最伟大的企业,成立于1987年,目前拥有超过18万名员工,业务遍及170多个国家和地区,年
数据库系统原理及应用 本书特色 本书特色:根据数据库发展的过程与特点,从不同角度出发,凝炼出数据库发展的三条线索覆盖的知识面广,既包括数据库理论,又包括数据库应...
JackStoufferJackStoufferisaprogrammerwhohasseveralyearsofexperienceindesigningwe...
《经济转型背景下的财富管理与资产配置》内容简介:当前,国际国内经济金融形势复杂多变,投资单一市场、单一资产的不确定性不断加
《Web前端自动化构建》内容简介:本书非常适合前端构建的初学者入门,所介绍的Gulp、Bower、Yeoman都是业内流行且易于上手的工具。
《中国经学史十讲》内容简介:“经”原先只是指代一种纺织工艺,在漫漫历史长河中,其逐渐变成了唯指孔子亲授的儒家五经的专称。朱
《AddingAjax中文版》讲述了如何在现有的Web应用程序中添加Ajax,为传统的Web应用程序带来更好的交互性,从而为应用程序附加更大
Namedoneofthegreatestmindsofthe20thcenturybyTime,TimBerners-Leeisresponsibleforo...