Disk-Based Algorithms for Big Data(Hardcover)
暫譯: 基於磁碟的大數據演算法(精裝版)

Christopher G. Healey

  • 出版商: CRC
  • 出版日期: 2016-12-07
  • 售價: $2,990
  • 貴賓價: 9.5$2,841
  • 語言: 英文
  • 頁數: 208
  • 裝訂: Hardcover
  • ISBN: 1138196185
  • ISBN-13: 9781138196186
  • 相關分類: 大數據 Big-dataAlgorithms-data-structures
  • 海外代購書籍(需單獨結帳)

商品描述

Disk-Based Algorithms for Big Data is a product of recent advances in the areas of big data, data analytics, and the underlying file systems and data management algorithms used to support the storage and analysis of massive data collections. The book discusses hard disks and their impact on data management, since Hard Disk Drives continue to be common in large data clusters. It also explores ways to store and retrieve data though primary and secondary indices. This includes a review of different in-memory sorting and searching algorithms that build a foundation for more sophisticated on-disk approaches like mergesort, B-trees, and extendible hashing.

Following this introduction, the book transitions to more recent topics, including advanced storage technologies like solid-state drives and holographic storage; peer-to-peer (P2P) communication; large file systems and query languages like Hadoop/HDFS, Hive, Cassandra, and Presto; and NoSQL databases like Neo4j for graph structures and MongoDB for unstructured document data.

Designed for senior undergraduate and graduate students, as well as professionals, this book is useful for anyone interested in understanding the foundations and advances in big data storage and management, and big data analytics.

About the Author

Dr. Christopher G. Healey is a tenured Professor in the Department of Computer Science and the Goodnight Distinguished Professor of Analytics in the Institute for Advanced Analytics, both at North Carolina State University in Raleigh, North Carolina. He has published over 50 articles in major journals and conferences in the areas of visualization, visual and data analytics, computer graphics, and artificial intelligence. He is a recipient of the National Science Foundation’s CAREER Early Faculty Development Award and the North Carolina State University Outstanding Instructor Award. He is a Senior Member of the Association for Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE), and an Associate Editor of ACM Transaction on Applied Perception, the leading worldwide journal on the application of human perception to issues in computer science.

商品描述(中文翻譯)

《基於磁碟的大數據演算法》是近期在大數據、數據分析以及支援大量數據集合儲存和分析的底層檔案系統和數據管理演算法領域的最新進展的產物。本書討論了硬碟及其對數據管理的影響,因為硬碟驅動器在大型數據集群中仍然是常見的存儲設備。它還探討了通過主要和次要索引來儲存和檢索數據的方法。這包括對不同的內存排序和搜尋演算法的回顧,這些演算法為更複雜的磁碟上方法(如合併排序、B樹和可擴展哈希)奠定了基礎。

在這個介紹之後,本書轉向更近期的主題,包括先進的儲存技術,如固態硬碟和全息儲存;點對點(P2P)通信;大型檔案系統和查詢語言,如Hadoop/HDFS、Hive、Cassandra和Presto;以及NoSQL數據庫,如用於圖形結構的Neo4j和用於非結構化文檔數據的MongoDB。

本書旨在為高年級本科生、研究生以及專業人士提供幫助,對於任何有興趣了解大數據儲存和管理的基礎及進展,以及大數據分析的人士都非常有用。

關於作者

Christopher G. Healey博士是北卡羅來納州立大學計算機科學系的終身教授及高級分析研究所的Goodnight傑出分析教授。他在可視化、視覺和數據分析、計算機圖形學和人工智慧等領域的主要期刊和會議上發表了超過50篇文章。他是美國國家科學基金會的CAREER早期教職發展獎和北卡羅來納州立大學傑出講師獎的獲得者。他是計算機協會(ACM)和電氣與電子工程師學會(IEEE)的高級會員,並且是ACM應用感知期刊的副編輯,該期刊是全球在計算機科學中應用人類感知問題的領先期刊。