Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
《新文科背景下的外语教学研究》内容简介:本书共收集34篇论文,涵盖语言学、文学、外语教学等各个学科领域,结合新文科建设,围绕
叶夫根尼·莫罗佐夫(EvgenyMorozov),科技互联网批评家,《新共和》杂志编辑,《纽约时报》、《金融时报》、《华尔街日报》、《
《PyTorch机器学习从入门到实战》内容简介:近年来,基于深度学习的人工智能掀起了一股学习的热潮。本书是使用PyTorch深度学习框架
奈良美智出生於1959年12月5日,日本青森縣弘前市人。是日本現代美術界極具影響力的畫家。1981~1988年在愛知縣立藝術大學和研究所
本书从移动通信的基本知识入手,对TD-SCDMA无线系统的原理和实现做了详细讲解,并重点阐述了RNC和NodeB的总体设计和功能实现,使
《中国客家对联大典(第三卷)(精)》内容简介:本书收录的对联是全世界历代客家人或含有客家元素的对联作品。这里包括全世界客家
《分布式系统概念与设计》旨在全面介绍因特网及其他常用分布式系统的原理、体系结构、算法和设计,内容涵盖分布式系统的相关概念
Searchisnotjustaboxandtenbluelinks.Searchisajourney:anexplorationwherewhatweenco...
The#1TelecomGuideforBusinesspeopleandNontechnicalProfessionals—FullyUpdatedforCl...
《JSF第一步:JSF+Spring+Hibernate+AJAX编程》讲述JSF是表示层框架的标准,Hibernate是一个比较完善的对象关系映射工具,Spr...
《实用社交礼仪》内容简介:礼仪是一首古老而年轻的诗,“飘散着舒人而温馨的国风”;礼仪是一曲涓涓的高山流水,吟唱着中华五千年
《大道至简》内容简介:本书提出了审视软件工程的全新视角和软件工程的体系模型(EHM,软件工程层状模型)。本书用非工程的方式重新解
《爱上古诗文》内容简介:一年一度的上海小学生古诗文大会暨古诗文“桂冠少年”选拔活动即将在9月份启动,承办方上海教育报刊总社《
《小红书达人实操攻略》内容简介:小红书以其操作简单、界面简约、阅读轻松的特点吸引了不少年轻人,是当下流行的分享和发现世界精
为了彻底理解是什么使得Linux能正常运行以及其为何能在各种不同的系统中运行良好,你需要深入研究内核最本质的部分。内核处理CPU
此书对中国网络媒体的第一个十年这一重要的历史阶段首次进行了全景式、全程式的历史记录,并进行了全面深入的研究,在一定程度上
一个会点石成金的神仙分别问三个人想要什么。第一个人说,我要很多很多的金子,然后神仙用手指往他面前的石头一点,石头就变成了
Everystageinthedesignofanewwebsiteisanopportunitytomeetormissdeadlinesandbudgeta...
《系统安装、维护及故障排除实战》内容简介:本书由资深计算机硬件工程师精心编写,讲解了安装操作系统前的准备、分区与格式化硬盘
photoshop cs5入门与提高 本书特色 本书从实用的角度出发,全面、系统地讲解了photoshopcs5的所有应用功能,基本涵盖了photoshopcs...