Mastering Prometheus: Gain expert tips to monitoring your infrastructure, applications, and services
暫譯: 精通 Prometheus:獲取監控基礎設施、應用程式和服務的專家技巧
Hegedus, William
- 出版商: Packt Publishing
- 出版日期: 2024-04-19
- 售價: $1,890
- 貴賓價: 9.5 折 $1,796
- 語言: 英文
- 頁數: 310
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1805125664
- ISBN-13: 9781805125662
立即出貨 (庫存=1)
相關主題
商品描述
Learn how to effectively implement, manage, and optimize Prometheus for monitoring your systems
Key Features- Achieve high availability with Prometheus by using Thanos
- Integrate Prometheus into your broader observability stack with OpenTelemetry
- Tweak, tune, and debug Prometheus to reliably scale without limits
- Purchase of the print or Kindle book includes a free PDF eBook
With an increased focus on observability and reliability, establishing a scalable and reliable monitoring environment is more important than ever. Over the last decade, Prometheus has emerged as the leading open-source, time-series based monitoring software catering to this demand. This book is your guide to scaling, operating, and extending Prometheus from small on-premises workloads to multi-cloud globally distributed workloads and everything in between.
Starting with an introduction to Prometheus and its role in observability, the book provides a walkthrough of its deployment. You'll explore Prometheus's query language and TSDB data model, followed by dynamic service discovery for monitoring targets and refining alerting through custom templates and formatting. The book then demonstrates horizontal scaling of Prometheus via sharding and federation, while equipping you with debugging techniques and strategies to fine-tune data ingestion. Advancing through the chapters, you'll manage Prometheus at scale through CI validations and templating with Jsonnet, and integrate Prometheus with other projects such as OpenTelemetry, Thanos, VictoriaMetrics, and Mimir.
By the end of this book, you'll have practical knowledge of Prometheus and its ecosystem, which will help you discern when, why, and how to scale it to meet your ever-growing needs.
What you will learn- Deploy Prometheus and Node Exporter to public clouds and Kubernetes
- Gain in-depth knowledge of how Prometheus's underlying code works
- Build your own custom service-discovery providers for Prometheus
- Debug Prometheus performance issues to identify cardinality issues in your environment
- Use VictoriaMetrics and/or Grafana Mimir for remote storage of Prometheus data
- Define and implement SLO-based alerting
The book is for site reliability engineers (SREs), developers, and platform engineers involved in the monitoring and observability of their team or company's systems. A background in Prometheus is assumed, so the book dedicates minimal time to the basics of getting Prometheus up and running. Whether you aim to expand monitoring capabilities, streamline configuration management, or enhance integration with existing tools, this book will help you maximize the potential of your Prometheus monitoring stack.
Table of Contents- Observability, Monitoring, and Prometheus
- Deploying Prometheus
- The Prometheus Data Model and PromQL
- Using Service Discovery
- Effective Alerting with Prometheus
- Advancing Prometheus: Sharding, Federation, and HA
- Optimizing and Debugging Prometheus
- Enabling Systems Monitoring with the Node Exporter
- Utilizing Remote Storage Systems with Prometheus
- Extending Prometheus Globally with Thanos
- Jsonnet and Monitoring Mixins
- Utilizing Continuous Integration (CI) Pipelines with Prometheus
- Defining and Alerting on SLOs
- Integrating OpenTelemetry with Prometheus
- Beyond Prometheus
商品描述(中文翻譯)
學習如何有效地實施、管理和優化 Prometheus 以監控您的系統
主要特點
- 通過使用 Thanos 實現 Prometheus 的高可用性
- 將 Prometheus 整合到您的更廣泛的可觀察性堆疊中,使用 OpenTelemetry
- 調整、調優和除錯 Prometheus,以可靠地無限制擴展
- 購買印刷版或 Kindle 書籍包括免費 PDF 電子書
書籍描述
隨著對可觀察性和可靠性的重視增加,建立一個可擴展且可靠的監控環境比以往任何時候都更為重要。在過去十年中,Prometheus 已成為滿足這一需求的領先開源時間序列監控軟體。本書是您從小型本地工作負載到全球多雲分佈工作負載及其間所有內容的擴展、操作和擴展 Prometheus 的指南。
本書首先介紹 Prometheus 及其在可觀察性中的角色,然後提供其部署的逐步指導。您將探索 Prometheus 的查詢語言和 TSDB 數據模型,接著是針對監控目標的動態服務發現,以及通過自定義模板和格式化來完善警報。然後,本書演示了通過分片和聯邦來實現 Prometheus 的水平擴展,同時為您提供除錯技術和策略,以微調數據攝取。隨著章節的推進,您將通過 CI 驗證和使用 Jsonnet 進行模板化來大規模管理 Prometheus,並將 Prometheus 與其他項目如 OpenTelemetry、Thanos、VictoriaMetrics 和 Mimir 整合。
在本書結束時,您將擁有 Prometheus 及其生態系統的實用知識,這將幫助您辨別何時、為何以及如何擴展以滿足不斷增長的需求。
您將學到的內容
- 將 Prometheus 和 Node Exporter 部署到公共雲和 Kubernetes
- 深入了解 Prometheus 的底層代碼如何運作
- 為 Prometheus 構建自定義服務發現提供者
- 除錯 Prometheus 性能問題,以識別您環境中的基數問題
- 使用 VictoriaMetrics 和/或 Grafana Mimir 進行 Prometheus 數據的遠程存儲
- 定義和實施基於 SLO 的警報
本書適合誰
本書適合參與其團隊或公司系統的監控和可觀察性的網站可靠性工程師(SRE)、開發人員和平台工程師。假設讀者具備 Prometheus 的背景,因此本書對於啟動和運行 Prometheus 的基本知識所花的時間最少。無論您旨在擴展監控能力、簡化配置管理,還是增強與現有工具的整合,本書都將幫助您最大化 Prometheus 監控堆疊的潛力。
目錄
- 可觀察性、監控與 Prometheus
- 部署 Prometheus
- Prometheus 數據模型與 PromQL
- 使用服務發現
- 使用 Prometheus 進行有效的警報
- 推進 Prometheus:分片、聯邦與高可用性
- 優化和除錯 Prometheus
- 使用 Node Exporter 啟用系統監控
- 利用遠程存儲系統與 Prometheus
- 通過 Thanos 全球擴展 Prometheus
- Jsonnet 和監控混合
- 利用持續集成(CI)管道與 Prometheus
- 定義和警報 SLO
- 將 OpenTelemetry 與 Prometheus 整合
- 超越 Prometheus