Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
Photoshop CC自学魔法书-(附光盘) 本书特色 《Photoshop CC自学魔法书》为Photoshop初学者量身打造,是入门级读者快速、全面掌握P...
微机原理与接口技术实用教程 目录 第1章 微型计算机概述.1.1 计算机的发展概况 1.1.1 计算机的发展历程1.1.2 微型计算机的发展历程1.2 微处理器...
《可穿戴医疗——移动医疗新浪潮》内容简介:可穿戴设备作为互联网下一阶段的智能载体,已经开始进入人们生活的方方面面,特别是在
《Minecraft我的世界》内容简介:越玩越聪明! Minecraft我的世界是一款高自由度的沙盒建造游戏,玩家可以在游戏中的三维空间里创造
《命运好好玩(汉、英双语版)》内容简介:蔡澜为人幽默风雅,以鲜活、生动的文字讲述他的所见所闻,与读者分享他的识见。他说:“
《京胡伴奏京剧经典唱段选》内容简介:这本《陈平一京胡伴奏京剧经典唱段》包含青衣、花脸、老旦、老生的唱腔名段,包括《西施》、
【本书目录】Introduction7WhyVintage?THeClothes14Greatvintagepieceswornbywomenlikeyou.E...
《PHP核心技术与最佳实践》是一本致力于为希望成为中高级PHP程序员的读者提供高效而有针对性指导的经典著作。系统归纳和深刻解读
Readytoexploretheglamourousworldofwirelesssensornetworking?Createdistributedsens...
《中原经济区竞争力报告(2017)》内容简介:本书围绕传统平原农区工业化与经济社会转型的这个主轴,就经济竞争力、社会保障建设、
本书对全新的移动服务(丰富的语音系统、因特网、信息和内容服务)的迅猛发展做出了全面描述。它介绍了全世界通信市场的发展史,
用AngularJS开发下一代Web应用 本书特色 我们都希望开发更小型、更轻量的Web应用,让创建应用更加容易,并且当项目变大时仍然易于测试、扩展和维护。这本...
ObjectOrientedProgrammingisaveryimportantaspectofmodernprogramminglanguages.Theb...
《西安史话》内容简介:本书只是对西安厚重历史的故事呈现,举重若轻;只是对西安3100多年建城史和1100多年建都史的粗线勾勒,挂一
《齐善鸿讲道德经》内容简介:如何轻松学到《道德经》的精髓? 如何打破艰难晦涩的语句直达道学根本? 如何在生活和工作中运用《道
笔记本电脑完全宝典 本书特色 本书采用环境教学法,版式新颖、美观实用,全程图解、快速上手,双色印刷、轻松阅读,书盘结合、互动教学。在内容的安排上,由浅入深、较有...
《数字货币——比特币数据报告与操作指南》是壹比特科技数字货币研究团队倾力编写的一本关于数字货币白皮书,书中详细阐述了包括
《百马人生,跑向东京》内容简介:从2010完成第一场马拉松,到2017年12月完成第100场马拉松,田同生“百马人生”的梦想变成了现实。
《李阳冰篆书三坟记》内容简介:《三坟记》由唐李季卿撰文,李阳冰书,为其篆书代表作。立于唐大历二年(767),碑文阴阳两面,二十
内容简介:作为服务器端的JavaScript解释器,Node是一个轻量高效的开发平台,用于构建响应快速、高度可扩展的Web应用。它使用事件