商品描述
This book offers a comprehensive resource on Solid-State Drives (SSD) as the field undergoes a radical evolution characterized by the incredible variety of SSD forms and their rapid diversification. It proposes a new classification system to help readers navigate the SSD landscape. For years, the evolution of SSDs was obscured by the unchanging abstractions of block devices and POSIX I/O, but it is apparent that these abstractions have become a problematic hinderance to performance and also fail to reduce software complexity. The book explores how such a state of affairs impacts the database community in at least two ways. First, it considers how using SSDs through legacy interfaces that hide internal mechanisms invariably results in erratic performance. While the blame often goes to the notoriously expensive garbage collection of SSDs, the authors argue that in truth, several other complex processes result in nonlinear effects on latency and bandwidth. The book describes these processes and how they are implemented in modern devices, knowledge that will help system designers better choose SSDs and shape database workloads to match their performance characteristics. Second, the book explores how the inadequacy of the traditional I/O abstractions opens up an entire research field focused on the co-design of database management systems and SSD. Such research aims at devising mechanisms and policies coupling the storage manager of database and SSD internals, e.g., placing an SSD FTL under the control of database, changing SSD sub-systems in response to the workload, or executing logic within an SSD on a database's behalf. The book introduces these principles of DBMS/SSD co-design and argues that a more seamless integration of databases and storage solutions as well as the study of SSD variations adapted to database computations are central to the development of the next generation of database systems.
商品描述(中文翻譯)
本書提供了一個關於固態硬碟(SSD)的全面資源,因為該領域正經歷著一場激進的演變,特徵是SSD形式的驚人多樣性及其快速的多元化。它提出了一個新的分類系統,以幫助讀者導航SSD的生態系統。多年來,SSD的演變被區塊設備和POSIX I/O的不變抽象所掩蓋,但顯然這些抽象已成為性能的問題性障礙,並且未能減少軟體的複雜性。本書探討了這種情況對資料庫社群的影響,至少有兩個方面。首先,它考慮到通過隱藏內部機制的舊有介面使用SSD,必然會導致不穩定的性能。雖然責任常常歸咎於SSD著名的高成本垃圾回收,但作者主張,事實上,還有幾個其他複雜的過程會對延遲和帶寬產生非線性影響。本書描述了這些過程及其在現代設備中的實現,這些知識將幫助系統設計師更好地選擇SSD並調整資料庫工作負載以匹配其性能特徵。其次,本書探討了傳統I/O抽象的不足如何開啟一個專注於資料庫管理系統和SSD共同設計的全新研究領域。這類研究旨在設計機制和政策,將資料庫的儲存管理器與SSD內部結合,例如,將SSD的FTL置於資料庫的控制之下,根據工作負載變更SSD子系統,或在SSD內部代表資料庫執行邏輯。本書介紹了DBMS/SSD共同設計的原則,並主張資料庫與儲存解決方案的更無縫整合,以及針對資料庫計算調整的SSD變體的研究,對於下一代資料庫系統的發展至關重要。
作者簡介
Alberto Lerner has a mixed profile of industrial and academic work with over thirty years of experience. He worked as research or software technical staff at numerous tech companies, such as IBM, Google, and MongoDB and as a consultant at many database start-ups. His interest revolves around high-scale, high-performance distributed systems, particularly using heterogeneous hardware to support them. He has participated in designing and implementing several such systems, including, more recently, the X-SSD device and ongoing efforts to create more easily programmable co-designed devices. Alberto has been a Senior Researcher at the Computer Science Department of the University of Fribourg in Switzerland since 2018. He has been on several Program Committees for the Database and Systems communities, including SIGMOD, VLDB, CIDR, EDBT, ICDE, and Usenix ATC.
Philippe Bonnet is an experimental computer scientist with a background in database systems. For thirty years, Philippe has explored the design, implementation, and evaluation of database systems in the context of successive generations of computer classes, including wireless sensor networks, computer clusters, and most recently disaggregated heterogeneous computers. Philippe is an expert on storage system software. He contributed to the uFlip Benchmark, the Linux multiqueue block layer, the Linux framework for Open-Channel SSDs, the OX architecture for computational storage, the xNVMe library, and Delilah, a prototype for eBPF offload on computational storage. Philippe has been a professor in the department of Computer Science at the IT University of Copenhagen since 2016. He is a trustee of the VLDB Endowment and currently chairs the ACM EIG on Reproducibility and Replicability.
作者簡介(中文翻譯)
阿爾貝托·勒納(Alberto Lerner)擁有工業與學術工作的混合背景,擁有超過三十年的經驗。他曾在多家科技公司擔任研究或軟體技術人員,如 IBM、Google 和 MongoDB,並在許多資料庫初創公司擔任顧問。他的興趣集中在高規模、高效能的分散式系統,特別是使用異質硬體來支援這些系統。他參與設計和實現了幾個此類系統,包括最近的 X-SSD 裝置以及持續努力創建更易於編程的共同設計裝置。自 2018 年以來,阿爾貝托一直擔任瑞士弗里堡大學計算機科學系的高級研究員。他曾參與多個資料庫和系統社群的程序委員會,包括 SIGMOD、VLDB、CIDR、EDBT、ICDE 和 Usenix ATC。
菲利普·博內(Philippe Bonnet)是一位實驗性計算機科學家,擁有資料庫系統的背景。在過去三十年中,菲利普探索了資料庫系統的設計、實現和評估,涵蓋了不同世代的計算機類別,包括無線感測器網路、計算機叢集,以及最近的解耦異質計算機。菲利普是儲存系統軟體的專家。他參與了 uFlip Benchmark、Linux 多佇列區塊層、Linux 的 Open-Channel SSD 框架、計算儲存的 OX 架構、xNVMe 函式庫,以及 Delilah,這是一個針對計算儲存的 eBPF 卸載原型。自 2016 年以來,菲利普一直擔任哥本哈根 IT 大學計算機科學系的教授。他是 VLDB 基金會的受託人,並目前擔任 ACM EIG 在可重現性和可複製性方面的主席。