Supervised Learning: Mathematical Foundations and Real-World Applications
暫譯: 監督式學習:數學基礎與實際應用
Chakrabarty, Dalia
- 出版商: CRC
- 出版日期: 2025-03-17
- 售價: $5,550
- 貴賓價: 9.5 折 $5,273
- 語言: 英文
- 頁數: 344
- 裝訂: Hardcover - also called cloth, retail trade, or trade
- ISBN: 1032283300
- ISBN-13: 9781032283302
海外代購書籍(需單獨結帳)
商品描述
This book discusses the relevance of probabilistic supervised learning, to the pursuit of automated and reliable prediction of an unknown that is in a state of relationship with another variable. The book provides methods for secured mechanistic learning of the function that represents this relationship between the output and input variables, where said learning is undertaken within the remit of real-world information that can be messy in different ways. For example, the available data may be highly multivariate or be high-dimensional, reflecting the nature of the output variable that could be a vector, or matrix, or even higher in dimension, as is often the case in a real-world application. Additionally, the data is noisy, and often it is small to moderately large in size in multiple applications. Another difficulty that regularly arises is that the training dataset - comprising pairs of values of the input and output -is such, that the sought function cannot be captured by a parametric shape, but is instead underlined by an inhomogeneous correlation structure. These difficulties notwithstanding, we desire a streamlined methodology that allows the learning of the inter variable relationship - to ultimately permit fast and reliable predictions of the output, at newly recorded values of the input. In fact, occasions arise when one seeks values of the input at which a new output value is recorded, and such a demand is also addressed in the book.
The generic solution to the problem of secured supervised learning amidst real-world messiness, lies in treating the sought inter-variable relation as a (function-valued) random variable, which, being random, is ascribed a probability distribution. Then recalling that distributions on the space of functions are given by stochastic processes, the sought function is proposed to be a sample function of a stochastic process. This process is chosen as one that imposes minimal constraints on the sought function - identified as a Gaussian Process (GP) in the book. Thus, the sought function can be inferred upon, as long as the co-variance function of the underlying GP is learnt, given the available training set. The book presents probabilistic techniques to undertake said learning, within the challenges borne by the data, and illustrates such techniques on real data. Learning of a function is always followed by closed-form prediction of the mean and dispersion of the output variable that is realised at a test input.
To help with the background, the book includes reviews on stochastic processes and basic probability theory. This will render the first half of the book useful for students across disciplines, while the latter half will be appreciated by students of numerate subjects at the postgraduate level or higher, including students of computational sciences, statistics and mathematics.
商品描述(中文翻譯)
這本書討論了機率監督學習在自動化和可靠預測一個與另一變數有關的未知數的相關性。書中提供了安全的機械學習方法,用於表示輸入變數和輸出變數之間關係的函數,這種學習是在現實世界中進行的,這些資訊可能以不同的方式雜亂無章。例如,現有的數據可能是高度多變量的,或是高維的,反映了輸出變數的性質,該變數可能是向量、矩陣,甚至更高維度,這在現實應用中經常發生。此外,數據是嘈雜的,並且在多個應用中,數據的大小通常是小到中等的。另一個經常出現的困難是,訓練數據集——由輸入和輸出值的對組成——的特性使得所尋求的函數無法用參數形狀捕捉,而是由不均勻的相關結構所主導。儘管存在這些困難,我們仍然希望有一種簡化的方法論,能夠學習變數之間的關係,最終允許在新記錄的輸入值下快速且可靠地預測輸出。事實上,會出現尋求輸入值的情況,以便記錄新的輸出值,這種需求在書中也有討論。
在現實世界的雜亂中,安全監督學習問題的通用解決方案在於將所尋求的變數間關係視為一個(函數值)隨機變數,這個隨機變數被賦予一個機率分佈。然後回想起在函數空間上的分佈是由隨機過程給出的,所尋求的函數被提出為隨機過程的一個樣本函數。這個過程被選擇為對所尋求的函數施加最小約束的過程——在書中被識別為高斯過程(Gaussian Process, GP)。因此,只要學會了基礎GP的協方差函數,就可以對所尋求的函數進行推斷,前提是有可用的訓練集。書中展示了在數據挑戰下進行上述學習的機率技術,並在真實數據上說明了這些技術。函數的學習總是隨之而來的是對在測試輸入下實現的輸出變數的均值和離散度的封閉形式預測。
為了幫助讀者理解背景,書中包括了對隨機過程和基本機率論的回顧。這將使書的前半部分對各學科的學生有用,而後半部分則會受到研究生或更高層次的數學、統計和計算科學等數量科目學生的欣賞。
作者簡介
Dr. Dalia Chakrabarty is a Reader in Statistical Data Science in the Department of Mathematics at the University of York. Her PhD is from St. Cross College in the University of Oxford, and she works on the development of methods to permit the probabilistic learning of random variables of various kinds, given real world data that is diversely challenging.
作者簡介(中文翻譯)
達莉亞·查克拉巴提博士是約克大學數學系的統計數據科學講師。她的博士學位來自牛津大學聖克羅斯學院,專注於開發方法,以便在面對各種現實世界中多樣且具挑戰性的數據時,進行隨機變量的概率學習。