Bigquery for Data Warehousing: Managed Data Analysis in the Google Cloud
暫譯: BigQuery 數據倉儲:在 Google Cloud 中的管理數據分析
Mucchetti, Mark
相關主題
商品描述
Create a data warehouse, complete with reporting and dashboards using Google's BigQuery technology. This book takes you from the basic concepts of data warehousing through the design, build, load, and maintenance phases. You will build capabilities to capture data from the operational environment, and then mine and analyze that data for insight into making your business more successful. You will gain practical knowledge about how to use BigQuery to solve data challenges in your organization.
BigQuery is a managed cloud platform from Google that provides enterprise data warehousing and reporting capabilities. Part I of this book shows you how to design and provision a data warehouse in the BigQuery platform. Part II teaches you how to load and stream your operational data into the warehouse to make it ready for analysis and reporting. Parts III and IV cover querying and maintaining, helping you keep your information relevant with other Google Cloud Platform services and advanced BigQuery. Part V takes reporting to the next level by showing you how to create dashboards to provide at-a-glance visual representations of your business situation. Part VI provides an introduction to data science with BigQuery, covering machine learning and Jupyter notebooks.
What You Will Learn
- Design a data warehouse for your project or organization
- Load data from a variety of external and internal sources
- Integrate other Google Cloud Platform services for more complex workflows
- Maintain and scale your data warehouse as your organization grows
- Analyze, report, and create dashboards on the information in the warehouse
- Become familiar with machine learning techniques using BigQuery ML
Who This Book Is For
Developers who want to provide business users with fast, reliable, and insightful analysis from operational data, and data analysts interested in a cloud-based solution that avoids the pain of provisioning their own servers.
商品描述(中文翻譯)
使用 Google 的 BigQuery 技術創建一個完整的數據倉庫,並配備報告和儀表板。本書將帶您從數據倉庫的基本概念開始,經過設計、構建、加載和維護階段。您將建立從操作環境捕獲數據的能力,然後挖掘和分析這些數據,以獲得有助於提升業務成功的見解。您將獲得有關如何使用 BigQuery 解決組織內數據挑戰的實用知識。
BigQuery 是 Google 提供的一個管理型雲平台,提供企業級數據倉庫和報告功能。本書的第一部分將向您展示如何在 BigQuery 平台上設計和配置數據倉庫。第二部分教您如何將操作數據加載和串流到數據倉庫中,以便為分析和報告做好準備。第三和第四部分涵蓋查詢和維護,幫助您保持信息與其他 Google Cloud Platform 服務和高級 BigQuery 的相關性。第五部分通過展示如何創建儀表板,將報告提升到一個新水平,以提供業務狀況的快速視覺表示。第六部分介紹了使用 BigQuery 的數據科學,涵蓋機器學習和 Jupyter notebooks。
您將學到什麼
- 為您的項目或組織設計數據倉庫
- 從各種外部和內部來源加載數據
- 整合其他 Google Cloud Platform 服務以實現更複雜的工作流程
- 隨著組織的增長,維護和擴展您的數據倉庫
- 分析、報告並創建倉庫中信息的儀表板
- 熟悉使用 BigQuery ML 的機器學習技術
本書適合誰
希望為業務用戶提供快速、可靠且具洞察力的操作數據分析的開發人員,以及對雲端解決方案感興趣的數據分析師,這種解決方案避免了配置自己伺服器的麻煩。
作者簡介
Mark Mucchetti is an industry technology leader in healthcare and ecommerce. He has been working with computers and writing software for over 30 years, starting with BASIC and Turbo C on an Intel 8088 and now using Node.js in the cloud. He has been building and managing technology groups for much of that time, combining his deep love of technical topics with his management skills to create world-class platforms. Mark has also worked in databases, release engineering, front- and back-end coding, and project management. He believes that the best decisions are made with the best data available, and that BigQuery is a great technology to increase the value and accessibility of data for business leaders on a day-to-day basis. He has seen the transformation that accurate, timely data has on an organization's ability to succeed, and wants to bring that knowledge to the world in a people-first way.
作者簡介(中文翻譯)
Mark Mucchetti 是醫療保健和電子商務領域的技術領導者。他在電腦和軟體開發方面已有超過 30 年的經驗,最初使用 Intel 8088 上的 BASIC 和 Turbo C,現在則在雲端使用 Node.js。他在這段時間內一直在建立和管理技術團隊,將他對技術主題的深厚熱愛與管理技能結合,創造出世界級的平台。Mark 也曾從事資料庫、發佈工程、前端和後端編碼以及專案管理。他相信最佳的決策是基於最佳的數據,並且認為 BigQuery 是一項出色的技術,可以在日常運營中提高商業領導者對數據的價值和可及性。他見證了準確、及時的數據對組織成功能力的轉變,並希望以以人為本的方式將這些知識帶給世界。