Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
《高校辅导员工作案例精选》内容简介:本书是大学生思想政治工作案例集合,涉及学生思想政治教育、党团和班级建设、学业指导、日常
《AdobeDreamweaverCS5中文版经典教程》由Adobe公司的专家编写,是AdobeDreamweavelCS5软件的官方指定培训教材。全书共分为1...
ThislittlebookshowsJavaScriptdevelopershowtobuildsuperbwebapplicationswithCoffee...
《新能源和可再生能源发展与产业化研究》对新能源和可再生能源的含义和分类进行了理论界定,涉及类别主要包括太阳能、地热能、生
精通D3.js-交互式数据可视化高级编程 本书特色 本书以当前流行的数据可视化技术d3.js为主要内容,分为三大部分,共计13章。**部分讲述基础知识,第二部分...
《医学临床“三基”训练技能图解·医师分册(全新彩版)》内容简介:★全新内容:本书文字内容全部进行了重新编写,大幅度提高了入
《中国前沿:不如问问科学家吧》内容简介:本书精选了人类对未来生活无限畅想的“人机共生”、“太空探索”、“生物治疗”“粪菌移
《Photoshop CS5实战从入门到精通(超值版)》内容简介:《Photoshop CS5实战从入门到精通(超值版)》通过精选案例引导读者深入学
photoshop cs5入门与提高 本书特色 本书从实用的角度出发,全面、系统地讲解了photoshopcs5的所有应用功能,基本涵盖了photoshopcs...
《孙子兵法(插图本)》内容简介:本书是春秋末年孙武所著,为中国现存最古老最完备的军事学著作。《孙子兵法》自问世以来,对中国
《田小七来啦4:家庭里的小科学》内容简介:妈妈蔡小芹买了一筐鸡蛋,里面有好蛋也有坏蛋,田小七说他会魔法,一下子就能看出哪个是
《犹太人智慧书》内容简介:本书对《塔木德》《财箴》和《诺未门》中浩如烟海的智慧进行了归纳和总结,将其分为四个类别:经商智慧
《精通CSS+DIV网页样式与布局》从零开始,细致介绍CSS的语法规则,透彻讲解CSS应用于各种网页元素的步骤和技巧深入剖析,CSS+DIV
《中小银行运维架构》内容简介:本书为商业银行构建运维体系和掌握核心运维技术提供了指导。以一家中小型的商业银行为蓝本,讲述商
《代码整洁之道:程序员的职业素养》内容简介:本书是编程大师“Bob大叔”40余年编程生涯的心得体会的总结,讲解要成为真正专业的程
UI设计入门一本就够 本书特色 本书紧扣用户界面设计趋势,主要讲解了什么是UI设计,UI设计的原则与理念,UI的文字、图片和图标设计,网页UI设计,移动端UI设...
《医点就通》内容简介:我们在面对健康问题时,都有这样的困扰:自己和家人一生病就着急;一有病就往医院跑,费时费力;没有足够时
如果三十多年前艾斯林格没有遇见乔布斯,你我今天看到的iPhone和MacBook也许不是现在这个样子,当然,也许根本就没有这么酷的手机
TounderstandWebdesignitiscriticaltounderstanddesignfirstandtechnologysecond.What...
《当戈壁遇见长江》内容简介:戈壁挑战赛是中国企业家的练兵场,是对个人意志、体能素质、战略战术和团队协作等方面的综合考验。在