Data Analysis in Bi-Partial Perspective: Clustering and Beyond
暫譯: 雙向視角下的數據分析:聚類及其他方法
Owsiński, Jan W.
- 出版商: Springer
- 出版日期: 2020-08-14
- 售價: $4,510
- 貴賓價: 9.5 折 $4,285
- 語言: 英文
- 頁數: 153
- 裝訂: Quality Paper - also called trade paper
- ISBN: 3030133915
- ISBN-13: 9783030133917
-
相關分類:
Data Science
海外代購書籍(需單獨結帳)
相關主題
商品描述
This book presents the bi-partial approach to data analysis, which is both uniquely general and enables the development of techniques for many data analysis problems, including related models and algorithms. It is based on adequate representation of the essential clustering problem: to group together the similar, and to separate the dissimilar. This leads to a general objective function and subsequently to a broad class of concrete implementations. Using this basis, a suboptimising procedure can be developed, together with a variety of implementations.
This procedure has a striking affinity with the classical hierarchical merger algorithms, while also incorporating the stopping rule, based on the objective function. The approach resolves the cluster number issue, as the solutions obtained include both the content and the number of clusters. Further, it is demonstrated how the bi-partial principle can be effectively applied to a wide variety of problems in data analysis.The book offers a valuable resource for all data scientists who wish to broaden their perspective on basic approaches and essential problems, and to thus find answers to questions that are often overlooked or have yet to be solved convincingly. It is also intended for graduate students in the computer and data sciences, and will complement their knowledge and skills with fresh insights on problems that are otherwise treated in the standard "academic" manner.商品描述(中文翻譯)
這本書介紹了數據分析的雙部分方法,這種方法既具有獨特的通用性,又能夠為許多數據分析問題開發技術,包括相關的模型和算法。它基於對基本聚類問題的充分表述:將相似的事物分組在一起,並將不相似的事物分開。這導致了一個通用的目標函數,隨後形成了一個廣泛的具體實現類別。基於這一基礎,可以開發出一種次優化程序,並提供多種實現方式。
這一程序與經典的層次合併算法有著顯著的相似性,同時也納入了基於目標函數的停止規則。這種方法解決了聚類數量的問題,因為所獲得的解決方案同時包括了聚類的內容和數量。此外,還展示了如何有效地將雙部分原則應用於各種數據分析問題。
這本書為所有希望擴展對基本方法和基本問題的視角的數據科學家提供了寶貴的資源,從而找到那些經常被忽視或尚未得到令人信服的解決的問題的答案。它也適合計算機和數據科學的研究生,並將為他們的知識和技能提供新的見解,針對那些通常以標準「學術」方式處理的問題。