信息检索导论 英文版PDF电子书下载
- 电子书积分:15 积分如何计算积分?
- 作 者:(美)ChristopherD.Manning,PrabhakarRaghavan,(德)HinrichSchütze著
- 出 版 社:北京:人民邮电出版社
- 出版年份:2010
- ISBN:9787115218247
- 页数:482 页
Boolean retrieval 1 1
1 An example information retrieval problem 3 1
2 A first take at building an inverted index 6 1
3 Processing Boolean queries 9 1
4 The extended Boolean model versus ranked retrieval 13 1
5 References and further reading 16 1
The term vocabulary and postings lists 18 2
1 Document delineation and character sequence decoding 18 2
2 Determining the vocabulary of terms 21 2
3 Faster postings list intersection via skip pointers 33 2
4 Positional postings and phrase queries 36 2
5 References and further reading 43 2
Dictionaries and tolerant retrieval 45 3
1 Search structures for dictionaries 45 3
2 Wildcard queries 48 3
3 Spelling correction 52 3
4 Phonetic correction 58 3
5 References and further reading 59 3
Index construction 61 4
1 Hardware basics 62 4
2 Blocked sort-based indexing 63 4
3 Single-pass in-memory indexing 66 4
4 Distributed indexing 68 4
5 Dynamic indexing 71 4
6 Other types ofindexes 73 4
7 References and further reading 76 4
Index compression 78 5
1 Statistical properties of terms in information retrieval 79 5
2 Dictionary compression 82 5
3 Postings file compression 87 5
4 References and further reading 97 5
Scoring,term weighting,and the vector space model 100 6
1 Parametric and zone indexes 101 6
2 Term frequency and weighting 107 6
3 The vector space model for scoring 110 6
4 Variant tf-idf functions 116 6
5 References and further reading 122 6
Computing scores in a complete search system 124 7
1 Efficient scoring and ranking 124 7
2 Components of an information retrieval system 132 7
3 Vector space scoring and query operator interaction 136 7
4 References and further reading 137 7
Evaluation in information retrieval 139 8
1 Information retrieval system evaluation 140 8
2 Standard test collections 141 8
3 Evaluation of unranked retrieval sets 142 8
4 Evaluation of ranked retrieval results 145 8
5 Assessing relevance 151 8
6 A broader perspective:System quality and user utility 154 8
7 Results snippets 157 8
8 References and further reading 159 8
Relevance feedback and query expansion 162 9
1 Relevance feedback and pseudo relevance feedback 163 9
2 Global methods for query reformulation 173 9
3 References and further reading 177 9
XML retrieval 178 10
1 Basic XML concepts 180 10
2 Challenges in XML retrieval 183 10
3 A vector space model for XML retrieval 188 10
4 Evaluation of XML retrieval 192 10
5 Text-centric versus data-centric XML retrieval 196 10
6 References and further reading 198 10
Probabilistic information retrieval 201 11
1 Review of basic probability theory 202 11
2 The probability ranking principle 203 11
3 The binary independence model 204 11
4 An appraisal and some extensions 212 11
5 References and further reading 216 11
Language models for information retrieval 218 12
1 Language models 218 12
2 The query likelihood model 223 12
3 Language modeling versus other approaches in information retrieval 229 12
4 Extended language modeling approaches 230 12
5 References and further reading 232 12
Text classification and Naive Bayes 234 13
1 The text classification problem 237 13
2 Naive Bayes text classification 238 13
3 The Bernoulli model 243 13
4 Properties of Naive Bayes 245 13
5 Feature selection 251 13
6 Evaluation of text classification 258 13
7 References and further reading 264 13
Vector space classification 266 14
1 Document representations and measures of relatedness in vector spaces 267 14
2 Rocchio classification 269 14
3 k nearest neighbor 273 14
4 Linear versus nonlinear classifiers 277 14
5 Classification with more than two classes 281 14
6 The bias-variance tradeoff 284 14
7 References and further reading 291 14
Support vector machines and machine learning on documents 293 15
1 Support vector machines:The linearly separable case 294 15
2 Extensions to the support vector machine model 300 15
3 Issues in the classification of text documents 307 15
4 Machine-learning methods in ad hoc information retrieval 314 15
5 References and further reading 318 15
Flat clustering 321 16
1 Clustering in information retrieval 322 16
2 Problem statement 326 16
3 Evaluation of clustering 327 16
4 K-means 331 16
5 Model-based clustering 338 16
6 References and further reading 343 16
Hierarchical clustering 346 17
1 Hierarchical agglomerative clustering 347 17
2 Single-link and complete-link clustering 350 17
3 Group-average agglomerative clustering 356 17
4 Centroid clustering 358 17
5 Optimality of hierarchical agglomerative clustering 360 17
6 Divisive clustering 362 17
7 Cluster labeling 363 17
8 Implementation notes 365 17
9 References and further reading 367 17
Matrix decompositions and latent semantic indexing 369 18
1 Linear algebra review 369 18
2 Term-document matrices and singular value decompositions 373 18
3 Low-rank approximations 376 18
4 Latent semantic indexing 378 18
5 References and further reading 383 18
Web search basics 385 19
1 Background and history 385 19
2 Web characteristics 387 19
3 Advertising as the economic model 392 19
4 The search user experience 395 19
5 Index size and estimation 396 19
6 Near-duplicates and shingling 400 19
7 References and further reading 404 19
Web crawling and indexes 405 20
1 Overview 405 20
2 Crawling 406 20
3 Distributing indexes 415 20
4 Connectivity servers 416 20
5 References and further reading 419 20
Link analysis 421 21
1 TheWeb as agraph 422 21
2 PageRank 424 21
3 Hubs and authorities 433 21
4 References and further reading 439 21
- 《管理信息系统习题集》郭晓军 2016
- 《信息系统安全技术管理策略 信息安全经济学视角》赵柳榕著 2020
- 《卓有成效的管理者 中英文双语版》(美)彼得·德鲁克许是祥译;那国毅审校 2019
- 《物联网导论》张翼英主编 2020
- 《材料导论》张会主编 2019
- 《化工传递过程导论 第2版》阎建民,刘辉 2020
- 《ESG指标管理与信息披露指南》管竹笋,林波,代奕波主编 2019
- 《AutoCAD 2018自学视频教程 标准版 中文版》CAD/CAM/CAE技术联盟 2019
- 《跟孩子一起看图学英文》张紫颖著 2019
- 《大学计算机信息技术教程 2018版》张福炎 2018
- 《SQL与关系数据库理论》(美)戴特(C.J.Date) 2019
- 《魔法销售台词》(美)埃尔默·惠勒著 2019
- 《看漫画学钢琴 技巧 3》高宁译;(日)川崎美雪 2019
- 《优势谈判 15周年经典版》(美)罗杰·道森 2018
- 《社会学与人类生活 社会问题解析 第11版》(美)James M. Henslin(詹姆斯·M. 汉斯林) 2019
- 《海明威书信集:1917-1961 下》(美)海明威(Ernest Hemingway)著;潘小松译 2019
- 《迁徙 默温自选诗集 上》(美)W.S.默温著;伽禾译 2020
- 《上帝的孤独者 下 托马斯·沃尔夫短篇小说集》(美)托马斯·沃尔夫著;刘积源译 2017
- 《巴黎永远没个完》(美)海明威著 2017
- 《剑桥国际英语写作教程 段落写作》(美)吉尔·辛格尔顿(Jill Shingleton)编著 2019
- 《指向核心素养 北京十一学校名师教学设计 英语 七年级 上 配人教版》周志英总主编 2019
- 《办好人民满意的教育 全国教育满意度调查报告》(中国)中国教育科学研究院 2019
- 《北京生态环境保护》《北京环境保护丛书》编委会编著 2018
- 《人民院士》吴娜著 2019
- 《指向核心素养 北京十一学校名师教学设计 英语 九年级 上 配人教版》周志英总主编 2019
- 《中国人民的心》杨朔著;夕琳编 2019
- 《高等院校旅游专业系列教材 旅游企业岗位培训系列教材 新编北京导游英语》杨昆,鄢莉,谭明华 2019
- 《中华人民共和国成立70周年优秀文学作品精选 短篇小说卷 上 全2册》贺邵俊主编 2019
- 《指向核心素养 北京十一学校名师教学设计 数学 九年级 上 配人教版》周志英总主编 2019
- 《中华人民共和国成立70周年优秀文学作品精选 中篇小说卷 下 全3册》洪治纲主编 2019