Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
Word/Excel/PPT 2016从入门到精通 本书特色 ★本书《Word/Excel/PPT 2016从入门到精通》深入浅出,从基础入门知识到专业精通内容...
《ASP.NET4高级程序设计(第4版)》,本书是ASP.NET领域的鸿篇巨制,全面讲解了ASP.NET4的各种特性及其背后的工作原理,并给出了许
《阿里巴巴基本动作:管理者必须修炼的24个基本动作》内容简介:收齐日报很难,收到合格的日报更难,用日报管好团队难上加难?招不
CSS创意课:全球优秀交互页面设计 本书特色 《CSS创意课——全球优秀交互页面设计》由未来出版编著,王慧玲译,本书涵盖了一切你需要提高的CSS网页布局知识。跟...
《Simulink仿真及代码生成技术入门到精通》围绕Simulink软件的仿真和代码生成技术,从原理上展开阐述,把握整体,注重细节,让读
LearnJavaScriptandjQueryanicerwayThisfull-colorbookadoptsavisualapproachtoteachi...
《2020年法律硕士联考重要法条释解》内容简介:本书主要内容为法律硕士联考法条类图书,针对法条涵盖的考点,分析和讲解,包含5科,
《掘金:互联网+时代创业黄金指南》内容简介:“互联网+”这个词随着政府工作报告变得炙手可热,这个词既是对过去已经发生的总结,
《中国古代四大发明:源流、外传及世界影响》基于近30年间对考古发掘资料的利用、出土文物的考察和中外文献的考证,系统而深入地研
《大学生社会责任感培育的实践与探索》内容简介:本书围绕如何培育大学生的社会责任感,基于“全人教育”理念,即通过“社会学习”
《企业会计准则原文、应用指南案例详解(2023年版)》内容简介:企业会计准则是会计从业人员进行会计确认、会计计量、会计报告的基
《2024年MBA、MPA、MPAcc、MEM管理类联考综合能力逻辑历年真题分类精解》内容简介:本书针对逻辑题型,深入分析探究,用“举题型讲
《Hadoop技术内幕:深入解析MapReduce架构设计与实现原理》内容简介:“Hadoop技术内幕”共两册,分别从源代码的角度对“Common+H
《动漫美少年素描技法》内容简介:本书主要讲解了漫画美少年的绘制方法,其中包括漫画美少年的基本概念和分类、美少年头部的画法、
◎聯合推薦實踐大學設計學院院長/安郁茜政治大學科技管理研究所教授/李仁芳奧美廣告執行創意總監/胡湘雲設計,打造感動人心的
WhilethereareseveralbooksonprogrammingforMacOSX,AdvancedMacOSXProgramming:TheBig...
《创新思维与方法》内容简介:本书共12章,包括创新的基础知识、创新驱动发展、互联网+行动计划、大数据时代的思维变革、发明问题传
本书是数字通信领域的一本经典教材,通过对概率论及随机过程的复习,详细介绍了数字和模拟信源编码、数字调制信号和窄带信号与系
内容简介本书全面介绍了统计自然语言处理的基本概念、理论方法和最新研究进展,内容包括形式语言与自动机及其在自然语言处理中的
《Python学习手册(第3版)》讲述了:Python可移植、功能强大、易于使用,是编写独立应用程序和脚本应用程序的理想选择。无论你是刚