Small Summaries for Big Data
暫譯: 大數據的小摘要
Cormode, Graham, Yi, Ke
- 出版商: Cambridge
- 出版日期: 2020-12-10
- 售價: $2,490
- 貴賓價: 9.5 折 $2,366
- 語言: 英文
- 頁數: 278
- 裝訂: Hardcover - also called cloth, retail trade, or trade
- ISBN: 1108477445
- ISBN-13: 9781108477444
-
相關分類:
大數據 Big-data
海外代購書籍(需單獨結帳)
相關主題
商品描述
The massive volume of data generated in modern applications can overwhelm our ability to conveniently transmit, store, and index it. For many scenarios, building a compact summary of a dataset that is vastly smaller enables flexibility and efficiency in a range of queries over the data, in exchange for some approximation. This comprehensive introduction to data summarization, aimed at practitioners and students, showcases the algorithms, their behavior, and the mathematical underpinnings of their operation. The coverage starts with simple sums and approximate counts, building to more advanced probabilistic structures such as the Bloom Filter, distinct value summaries, sketches, and quantile summaries. Summaries are described for specific types of data, such as geometric data, graphs, and vectors and matrices. The authors offer detailed descriptions of and pseudocode for key algorithms that have been incorporated in systems from companies such as Google, Apple, Microsoft, Netflix and Twitter.
商品描述(中文翻譯)
現代應用程式產生的大量數據可能會超過我們方便傳輸、儲存和索引的能力。在許多情況下,建立一個大幅縮小的數據集摘要可以在某種程度上提供靈活性和效率,以便對數據進行各種查詢,這是以某些近似為交換的。這本針對實務工作者和學生的全面介紹數據摘要的書籍,展示了算法、它們的行為以及其運作的數學基礎。內容從簡單的總和和近似計數開始,逐步深入到更高級的概率結構,如 Bloom Filter、獨特值摘要、草圖和分位數摘要。針對特定類型的數據(如幾何數據、圖形以及向量和矩陣)描述了摘要。作者提供了詳細的關鍵算法描述和偽代碼,這些算法已被 Google、Apple、Microsoft、Netflix 和 Twitter 等公司的系統所採用。