Data-intensive Systems: Principles and Fundamentals using Hadoop and Spark (Advanced Information and Knowledge Processing)
暫譯: 數據密集型系統:使用 Hadoop 和 Spark 的原則與基礎(高級資訊與知識處理)
Tomasz Wiktorski
相關主題
商品描述
Data-intensive systems are a technological building block supporting Big Data and Data Science applications.This book familiarizes readers with core concepts that they should be aware of before continuing with independent work and the more advanced technical reference literature that dominates the current landscape.
The material in the book is structured following a problem-based approach. This means that the content in the chapters is focused on developing solutions to simplified, but still realistic problems using data-intensive technologies and approaches. The reader follows one reference scenario through the whole book, that uses an open Apache dataset.
The origins of this volume are in lectures from a master’s course in Data-intensive Systems, given at the University of Stavanger. Some chapters were also a base for guest lectures at Purdue University and Lodz University of Technology.
商品描述(中文翻譯)
數據密集型系統是支撐大數據和數據科學應用的技術基石。本書使讀者熟悉在進行獨立工作和深入研究當前主流的技術參考文獻之前,應該了解的核心概念。
本書的內容結構採用基於問題的方法。這意味著各章節的內容專注於使用數據密集型技術和方法來解決簡化但仍然現實的問題。讀者在整本書中跟隨一個參考場景,該場景使用了一個開放的 Apache 數據集。
本書的起源來自於斯塔萬格大學的數據密集型系統碩士課程的講座。有些章節也作為普渡大學和羅茲科技大學的客座講座的基礎。