Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
ThisbookisasuperreferencetoallthingsAPI-related.Itexplainstheunderlyingtheoryand...
《特战装备大百科(图鉴版)》内容简介:《现代兵器百科图鉴系列:特战装备大百科(图鉴版)》精选了百余种现代特种部队经典作战装
《史学史研究的理论与实践》内容简介:本书汇集作者对于史学理论与史学史的部分论述,分为四章,即“史学理论探析”、“史家史书考
《天文观测完全手册》内容简介:《天文观测完全手册》针对天文爱好者,以天文观测为主线,兼顾天文摄影,从我们最熟悉的太阳系讲起
NeedtoLearnMATLAB?ProblemSOLVED!GetstartedusingMATLABrightawayw...
《小创客趣玩microbit开发板编程》内容简介:本书定位于“编程入门与实践指南”,适合所有喜爱编程的青少年朋友阅读。书中对micro:
《Odoo快速入门与实战:Python开发ERP指南》内容简介:本书包括基础篇、研发技术篇和实战篇,按照由基础到进阶,再到实战的顺序讲解
《代码之髓:编程语言核心概念》作者从编程语言设计的角度出发,围绕语言中共通或特有的核心概念,通过语言演变过程中的纵向比较
MATLAB在时间序列分析中的应用 本书特色 本书简明扼要地介绍了时间序列及其相关领域的基本概念和基本理论,对ARMA序列预测、时间序列的统计分析、时间序列的时...
《医学专业课程思政优秀案例集(基础医学分册)》内容简介:本书精选了上海交通大学医学院众位教师近三年课程思政成果,在基础医学
Creo2.0产品设计实用教程 内容简介 全书共分十一章, 主要内容包括: 比较系统地介绍了Pro—E的工作界面、文件的操作以及鼠标的使用 ; 草绘 ; 拉伸、...
《矛与盾:黑客攻防命令大曝光》内容简介:《矛与盾:黑客攻防命令大曝光》紧紧围绕黑客命令与实际应用展开,在剖析黑客入侵中用户
格局设计是住宅设计不可或缺的一部分,也是打造好房子的关键。大到空间格局的划分,如厨房、客厅、卧室等空间的功能分区,小到壁
《历史的六个面孔》内容简介:本书介绍了春秋时的管仲、战国时的商鞅、西汉时的桑弘羊、唐代的刘晏、宋代的王安石、明代的张居正这
《Spring Cloud微服务:入门、实战与进阶》内容简介:本书主打的是与微服务相关的实战体系。第一部分是准备篇,可以帮助各位读者了
《Hadoop技术内幕》内容简介:本书由腾讯数据平台的资深Hadoop专家、X-RIME的作者亲自执笔,对Common和HDFS的源代码进行了分析,旨
无线网状网是近年来通信领域中令人瞩目的技术,无线网状网技术在产业化方面取得了显著的成果,在标准化方面也获得了长足的进展。
《Altium Designer 18电路设计从入门到精通(第2版)》内容简介:全书以Altium Designer 18版本为平台,介绍了电路设计的方法和技巧
市面上介绍Java的书有很多,但专注于Java性能的并不多,能游刃有余地展示Java性能优化难点的更是凤毛麟角,本书即是其中之一。通
《Python预测分析实战》内容简介:本书先介绍预测分析的重要概念和原则,然后给出一系列的代码示例和算法讲解,引导读者了解完整的