Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
社交网络已经彻底改变了互联网文化,也改变了整个世界的社交方式。MySpace为什么惨败给Facebook?社交网络的未来又在哪儿?几年前
让你的PPT会说话 本书特色 适读人群 :1.已初步掌握PPT基本操作,急待提升的职场白领 2.要用PPT打动别人的培训师和学校老师 3.即将走上社会需要快速掌...
Forone-semestercoursesinIntroductoryBiologyfornon-majors.LifeonEarth,FifthEditio...
《企业科技财税实操指南》内容简介:本书在了解研发和科技的基础上,结合企业研发活动流程与特征,概括性地介绍了激励企业研发的财
思想的窠臼是创意最大的杀手。我们在思考点子时,却总陷入过去曾有的经验为影像基础,而无法获得原创的点子。然而该如何分辨游荡
深入理解SOA与Web服务,对SOA进行全面介绍的实践指南:简化基础设施,发挥最大的机动性这是一本关于使用面向服务的架构(SOA,Se
JeffreyZeldman是世界上最知名的网站设计师之一。他的个人站点(www.zeldman.com)受到1600万访问者的欢迎,每天都有来自Web设计
《3G的业务及管理》力图全面系统地分析3G业务,在介绍业务网络的整体架构和3G业务的特点后,重点对各种业务(包括3G特色业务)的技
Visual Foxpro程序设计教程 本书特色 本书围绕“岳麓书院图书管理系统”实例,完整地描述了数据库应用系统开发的各个环节,将系统开发的具体步骤详细地贯穿...
《企业级Kubernetes应用》内容简介:Kubernetes从2015年7月21日发布1.0版本,经过三年多的时间不断发展至今,其作为开源的容器应用
《中国圣书:悦读《论语》》内容简介:《论语》是一部记录孔子及其弟子言行的语录体著作,是儒家学派最重要的经典,大约成书于战国
Gooduserinterfacedesignisntjustaboutaestheticsorusingthelatesttechnology.Designe...
《法治无禁区》内容简介:本书是作者在一线办案的思考与总结,紧密结合当下的司法改革实践,与*前沿的司法理念接轨。不忘初心,面向
Java Web从入门到精通(配光盘)(软件开发视频大讲堂) 本书特色 “软件开发视频大讲堂”丛书系清华社“视频大讲堂”重点大系之一。该大系包括多个子系列,每个...
《中国当代经典电影赏析》内容简介:本书是南京大学国际化合作项目“一带一路国家中国文化教学合作研究”的成果,主要针对中高级汉
计算机图形学 内容简介 本书全面介绍计算机图形学的系统组成、图形生成与显示算法以及交互实现技术。主要内容包括计算机图形系统、基本光栅图形生成技术、图形变换、交互...
本书在前五版的基础上改编而成,系统地介绍了现他通信系统的基本理论和阳新发展技术。全书共分八章:内容包括:绪论;信号与频谱
从0起飞Office 2007公司办公易学通 本书特色 从0起飞,电脑办公应用易学通。从0起步,模块教学,实例巧配,检测所学,视频直播,超值实惠,新手地话,解惑...
《全新Marc实例教程与常见问题解析》大部分案例来自于实际工程项目,不仅包含具体操作步骤的讲解,并配以图片说明以便用户能够即
《基于神经网络的智能诊断》共8篇,内容涉及神经网络智能诊断的产生、发展、现状与动向,复杂系统智能诊断问题的概念和策略;基于