
最 低 价:¥57.80
| “……我购买了这本书。这本书绝对是非常值得拥有的参考书。” ——l. venkata subramaniam,ibm印度研究实验室 “一本由该领域最重要专家编写的文本挖掘导论。这本书写得非常好,完美地结合了文本挖掘的理论和实践,既适合研究人员又适合实践者……极力推荐那些没有任何计算语言学背景而想钻研文本挖掘领域的人阅读本书。” ——rada mihalcea,北得克萨斯大学 |
| Ronen Feldman 机器学习、数据挖掘和非结构化数据管理的先驱人物。以色列Bar-Ilan大学数学与计算机科学系高级讲师、数据挖掘实验室主任,Clearforest公司(主要为企业和政府机构开发下一代文本挖掘应用)合作创始人、董事长,现在还是纽约大学Stern商学院的副教授。 |
| ⅰ. introduction to text mining ⅰ.1 defining text mining ⅰ.2 general architecture of text mining systems ⅱ. core text mining operations ⅱ.1 core text mining operations ⅱ.2 using background knowledge for text mining ⅱ.3 text mining query languages ⅲ. text mining preprocessing techniques ⅲ.1 task-oriented approaches ⅲ.2 further reading ⅳ. categorization ⅳ.1 applications of text categorization ⅳ.2 definition of the problem ⅳ.3 document representation ⅳ.4 knowledge engineering approach to tc ⅳ.5 machine learning approach to tc ⅳ.6 using unlabeled data to improve classification ⅳ.7 evaluation of text classifiers ⅳ.8 citations and notes ⅴ. clustering ⅴ.1 clustering tasks in text analysis ⅴ.2 the general clustering problem ⅴ.3 clustering algorithms ⅴ.4 clustering of textual data ⅴ.5 citations and notes ⅵ. information extraction ⅵ.1 introduction to information extraction ⅵ.2 historical evolution of ie: the message understanding conferences and tipster ⅵ.3 ie examples ⅵ.4 architecture of ie systems ⅵ.5 anaphora resolution ⅵ.6 inductive algorithms for ie ⅵ.7 structural ie ⅵ.8 further reading ⅶ. probabilistic models for information extraction ⅶ.1 hidden markov models ⅶ.2 stochastic context-free grammars ⅶ.3 maximal entropy modeling ⅶ.4 maximal entropy markov models ⅶ.5 conditional random fields ⅶ.6 further reading ⅷ. preprocessing applications using probabilistic and hybrid approaches ⅷ.1 applications of hmm to textual analysis ⅷ.2 using memm for information extraction ⅷ.3 applications of crfs to textual analysis ⅷ.4 teg: using scfg rules for hybrid statistical–knowledge-based ie ⅷ.5 bootstrapping ⅷ.6 further reading ⅸ. presentation-layer considerations for browsing and query refinement ⅸ.1 browsing ⅸ.2 accessing constraints and simple specification filters at the presentation layer ⅸ.3 accessing the underlying query language ⅸ.4 citations and notes ⅹ. visualization approaches ⅹ.1 introduction ⅹ.2 architectural considerations ⅹ.3 common visualization approaches for text mining ⅹ.4 visualization techniques in link analysis ⅹ.5 real-world example: the document explorer system ?. link analysis ?.1 preliminaries ?.2 automatic layout of networks ?.3 paths and cycles in graphs ?.4 centrality ?.5 partitioning of networks ?.6 pattern matching in networks ?.7 software packages for link analysis ?.8 citations and notes ?. text mining applications ?.1 general considerations ?.2 corporate finance: mining industry literature for business intelligence ?.3 a “horizontal” text mining application: patent analysis solution leveraging a commercial text analytics platform ?.4 life sciences research: mining biological pathway information with geneways appendix a: dial: a dedicated information extraction language fortext mining a.1 what is the dial language? a.2 information extraction in the dial environment a.3 text tokenization a.4 concept and rule structure a.5 pattern matching a.6 pattern elements a.7 rule constraints a.8 concept guards a.9 complete dial examples bibliography index |
商品评论(0条)