Automated Taxonomy Discovery and Exploration
暫譯: 自動化分類法發現與探索

Name: Automated Taxonomy Discovery and Exploration
Price: 2451 TWD
Availability: OnlineOnly
Author: Shen, Jiaming, Han, Jiawei
ISBN: 3031114078

Shen, Jiaming, Han, Jiawei

出版商: Springer
出版日期: 2023-10-02
售價: $2,580
貴賓價: 9.5 折 $2,451
語言: 英文
頁數: 103
裝訂: Quality Paper - also called trade paper
ISBN: 3031114078
ISBN-13: 9783031114076
相關分類: 大數據 Big-data、Machine Learning、Data Science

海外代購書籍(需單獨結帳)

商品描述

This book provides a principled data-driven framework that progressively constructs, enriches, and applies taxonomies without leveraging massive human annotated data. Traditionally, people construct domain-specific taxonomies by extensive manual curations, which is time-consuming and costly. In today's information era, people are inundated with the vast amounts of text data. Despite their usefulness, people haven't yet exploited the full power of taxonomies due to the heavy curation needed for creating and maintaining them. To bridge this gap, the authors discuss automated taxonomy discovery and exploration, with an emphasis on label-efficient machine learning methods and their real-world usages. Taxonomy organizes entities and concepts in a hierarchy way. It is ubiquitous in our daily life, ranging from product taxonomies used by online retailers, topic taxonomies deployed by news outlets and social media, as well as scientific taxonomies deployed by digital libraries across various domains. When properly analyzed, these taxonomies can play a vital role for science, engineering, business intelligence, policy design, ecommerce, and more. Intuitive examples are used throughout enabling readers to grasp concepts more easily.

In addition, this book:

Discusses the process of creating, maintaining, and applying taxonomies via simple, easy-to-understand examples
Provides a systematic review of the current research frontier of each task and discusses their real-world applications
Includes supporting materials containing links to commonly used evaluation datasets and a code repository of representative algorithms

商品描述(中文翻譯)

本書提供了一個原則性、數據驅動的框架，逐步構建、豐富並應用分類法，而不依賴大量人工標註的數據。傳統上，人們通過廣泛的手動整理來構建特定領域的分類法，這既耗時又昂貴。在當今的信息時代，人們面臨著大量文本數據的轟炸。儘管分類法非常有用，但由於創建和維護它們所需的繁重整理工作，人們尚未充分發揮分類法的全部潛力。為了彌補這一差距，作者討論了自動化的分類法發現和探索，重點介紹了標籤高效的機器學習方法及其在現實世界中的應用。分類法以層級方式組織實體和概念。它在我們的日常生活中無處不在，從在線零售商使用的產品分類法、新聞媒體和社交媒體部署的主題分類法，到各個領域的數字圖書館使用的科學分類法。當這些分類法得到妥善分析時，能在科學、工程、商業智能、政策設計、電子商務等方面發揮重要作用。本書中使用直觀的例子，幫助讀者更容易理解概念。

此外，本書還：
- 討論了通過簡單易懂的例子創建、維護和應用分類法的過程
- 提供了對每個任務當前研究前沿的系統回顧，並討論其現實世界的應用
- 包含支持材料，提供常用評估數據集的鏈接和代表性算法的代碼庫

作者簡介

Jiaming Shen, Ph.D., is a Research Scientist at Google Research working on data mining and natural language processing. His research aims to develop automated methods for mining knowledge from text data without excessive human annotations. He completed his Ph.D. from the University of Illinois at Urbana-Champaign and a B.S. degree from Shanghai Jiao Tong University. His research has been awarded several fellowships and scholarships, including a Brian Totty Graduate Fellowship and a Yunni & Maxine Pao Memorial Fellowship.
Jiawei Han, Ph.D. is a Michael Aiken Chair Professor at the University of Illinois at Urbana-Champaign. His research areas encompass data mining, text mining, data warehousing, and information network analysis, with over 800 research publications. He is a Fellow of both ACM and the IEEE and has received numerous prominent awards, including the ACM SIGKDD Innovation Award (2004) and the IEEE Computer Society W. Wallace McDowell Award (2009).

作者簡介(中文翻譯)

沈家明博士是Google Research的研究科學家，專注於資料探勘和自然語言處理。他的研究目標是開發自動化方法，從文本數據中挖掘知識，而不需要過多的人為標註。他在伊利諾伊大學香檳分校獲得博士學位，並在上海交通大學獲得學士學位。他的研究曾獲得多項獎學金和獎助金，包括Brian Totty研究生獎學金和Yunni & Maxine Pao紀念獎學金。

韓家偉博士是伊利諾伊大學香檳分校的Michael Aiken講座教授。他的研究領域包括資料探勘、文本探勘、資料倉儲和資訊網路分析，擁有超過800篇研究出版物。他是ACM和IEEE的會士，並獲得多項重要獎項，包括ACM SIGKDD創新獎（2004年）和IEEE計算機學會W. Wallace McDowell獎（2009年）。