"Mining the Web: Discovering Knowledge from Hypertext Data" is the first book devoted entirely to techniques for producing knowledge from the vast body of unstructured Web data. Building on an initial survey of infrastructural issues - including Web crawling and indexing - Chakrabarti examines low-level machine learning techniques as they relate specifically to the challenges of Web mining. He then devotes the final part of the book to applications that unite infrastructure and analysis to bring machine learning to bear on systematically acquired and stored data. Here the focus is on results: the strengths and weaknesses of these applications, along with their potential as foundations for further progress. From Chakrabarti's work-painstaking, critical, and forward-looking-readers will gain the theoretical and practical understanding they need to contribute to the Web mining effort. Features include: a comprehensive, critical exploration of statistics-based attempts to make sense of Web Mining; details the special challenges associated with analyzing unstructured and semi-structured data; looks at how classical Information Retrieval techniques have been modified for use with Web data; focuses on today's dominant learning methods: clustering and classification, hyperlink analysis, and supervised and semi-supervised learning; analyzes current applications for resource discovery and social network analysis; and, an excellent way to introduce students to especially vital applications of data mining and machine learning technology.
《课堂上的思维导图:中学生思维导图学习法》内容简介:英国博赞中心杰出华人讲师孙易新博士总结20多年思维导图法应用经验,专为中
《MATLAB神经网络应用设计》利用目前国际上流行的MATLAB环境,结合神经网络工具箱,在深入浅出地介绍人工神经网络中的各种典型网
Python是一种解释型、面向对象、动态数据类型的高级程序设计语言,自20世纪90年代初诞生至今,逐渐被广泛应用于处理系统管理任务
YourexpertguidetoimplementingScalableVectorGraphics(SVG)Divein-andquicklylearnho...
Focusingonthreeprincipalsystems-GPS,GALILEO,andGLONAS-thispracticalresourceprovi...
《儒匠——程泰宁传》内容简介:他痴迷武侠小说,却误打误撞地闯入建筑殿堂;他是第一位也是至今唯一一位被国外知名出版机构收入世
RichardA.ClarkewarnedAmericaoncebeforeaboutthehavocterrorismwouldwreakonournatio...
《生了卵巢癌,怎么吃》内容简介:我们根据何裕民教授40余年看诊5万余癌症患者的饮食抗癌经验,结合自己20余年从事肿瘤与饮食营养研
《把孩子交给爸爸》内容简介:在当下家庭教育中,普遍存在父亲教育缺失或不足的现象,本书作者作为一个相当称职的爸爸,给千万个家
《计算机科学组合学丛书·计算机密码学:计算机网络中的数据保密与安全(第3版)》是第2版的基础上,结合这几年的密码学技术的发展改
《法学实践(增订版)》内容简介:法学到底如何实践?本书即讨论这个问题,且更多地侧重于中国的背景,同时,还包含了更广阔的理论
《日志管理与分析(第2版)》内容简介:本书基于主流日志管理与分析系统的设计理念,完善、透彻地对日志分析各流程模块的原理与实现
本书是由40多位国外游戏开发行业最为优秀的程序员撰稿的技术文集。每篇文章都针对游戏编程中的某个特定问题,不仅提供了解决思路
《HTML与CSS网站设计实践之旅》从基础的网页知识开始详细讲解创建网站的全过程,内容涉及网站开发工具的选择、网页基本元素的介绍
《混合动力汽车拆装与检测》内容简介:本书采用基于工作过程的方法开发,以典型工作任务为载体组织内容,主要包括混合动力汽车认知
LearnJavaScriptandjQueryanicerwayThisfull-colorbookadoptsavisualapproachtoteachi...
想象你正在攀登一座名为“软件开发”的山峰。本书是与你同登一座山峰的敏捷先驱所带来的话语与图片。他在崎岖的山路边找到相当平
《物联网:万物数字化的利器》内容简介:这是一本介绍物联网生态的技术专著。全书从世界经济周期的分析开始,介绍了第六次“经济长
过阅读本书,你将能够:了解OSX和iOS应用的生命周期使用故事板设计自适应界面探索图形系统,包括内置的2D和3D游戏框架用AVFounda
杨树云中国著名化装艺术家。以整体塑造古代造型著称,因其丰富的实践经验、扎实的理论基础和深厚的文化底蕴,素有“天下第一梳”