Statistical Inference and Machine Learning for Big Data
暫譯: 大數據的統計推斷與機器學習
Alvo, Mayer
- 出版商: Springer
- 出版日期: 2022-12-01
- 售價: $5,910
- 貴賓價: 9.5 折 $5,615
- 語言: 英文
- 頁數: 463
- 裝訂: Hardcover - also called cloth, retail trade, or trade
- ISBN: 3031067835
- ISBN-13: 9783031067839
-
相關分類:
大數據 Big-data、Machine Learning
海外代購書籍(需單獨結帳)
商品描述
This book initially presents a variety of advanced statistical methods at a level suitable for advanced undergraduate and graduate students as well as others interested in familiarizing themselves with this important subject. Later, it proceeds to illustrate these methods in the context of real life applications. The non specialist seldom gets to see the main focus of modern statistics. Through the presentation of several real life applications in a variety of areas such as genetics and environmental problems one begins to gain an appreciation of the challenges and the utility of statistics.
The book begins in Part I by outlining various data types and by indicating how these are normally represented graphically and subsequently analyzed. In Part II, Chapters 2 and 3 we introduce the basic tools in probability and statistics. Here, we have retained the most useful and relevant results pertinent to this book. In Chapter 4, we proceed with an introduction to multivariate methods and to copula methods. We illustrate a number of applications by presenting real life examples. In Chapter 5 we introduce nonparametric methods which are particularly useful in the analysis of BIG DATA when the underlying distributions are often unknown. Some emphasis is placed on the use of ranking methods. We continue with a discussion of exponential tilting and its applications in Chapter 6. There we discuss the subject of empirical Bayes and its application to micro-array data. In Chapter 7, we touch on counting data analysis and survival analysis. In Chapter 8, time series methods are briefly described both from the usual classical as well as from the state space modeling approaches. Estimating equations and empirical likelihood are discussed in Chapter 9. We present their application in nonparametric testing. Symbolic data analysis is a relatively new field which aims to reduce the dimension of the data through a process of aggregation. It forms the subject of Chapter 10 wherein traditional statistical methods are applied to aggregated medical data. In Part III we focus first on the subject of regression through the lens of machine learning. In Chapter 11 we describe regression methods from the machine learning point of view along with support vector machines often used to study interactions and classification. We then continue in Chapter 12 with the important topics of neural networks and text analytics. We conclude with Part IV by presenting the computational aspects of BIG DATA with special attention devoted to Markov Chain Monte Carlo methods and to Bayesian nonparametric statistics.
This book was written for two key audiences. It would serve as a handy desk reference for statistical methods at the undergraduate and graduate level. It would also be useful in courses which aim to provide an overview of modern statistics and its applications.
商品描述(中文翻譯)
這本書最初以適合高年級本科生和研究生的水平介紹各種先進的統計方法,以及其他有興趣熟悉這一重要主題的人士。接著,它將這些方法應用於現實生活中的實例。非專業人士很少能看到現代統計的主要焦點。通過在遺傳學和環境問題等多個領域展示幾個現實生活中的應用,讀者開始理解統計的挑戰和實用性。
本書在第一部分開始時概述了各種數據類型,並指出這些數據通常是如何以圖形方式表示和隨後分析的。在第二部分的第二章和第三章中,我們介紹了概率和統計的基本工具。在這裡,我們保留了與本書相關的最有用和最重要的結果。在第四章中,我們介紹了多變量方法和聯合方法。我們通過展示現實生活中的例子來說明多個應用。在第五章中,我們介紹了非參數方法,這在分析大數據時特別有用,因為其潛在的分佈通常是未知的。我們特別強調了排名方法的使用。在第六章中,我們繼續討論指數傾斜及其應用。在那裡,我們討論了經驗貝葉斯及其在微陣列數據中的應用。在第七章中,我們簡要介紹了計數數據分析和生存分析。在第八章中,時間序列方法從傳統的經典方法和狀態空間建模方法兩個角度進行簡要描述。在第九章中,我們討論了估計方程和經驗似然,並展示了它們在非參數檢驗中的應用。符號數據分析是一個相對較新的領域,旨在通過聚合過程減少數據的維度。這是第十章的主題,其中傳統統計方法應用於聚合的醫療數據。在第三部分中,我們首先從機器學習的角度聚焦於回歸主題。在第十一章中,我們從機器學習的角度描述回歸方法,並介紹了常用於研究交互作用和分類的支持向量機。然後在第十二章中,我們繼續討論神經網絡和文本分析的重要主題。我們在第四部分結束時,介紹了大數據的計算方面,特別關注馬爾可夫鏈蒙特卡羅方法和貝葉斯非參數統計。
這本書是為兩個主要受眾而寫的。它將作為本科生和研究生統計方法的便捷參考書,也將在旨在提供現代統計及其應用概述的課程中發揮作用。