Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning
暫譯: 在 Google Cloud Platform 上的資料科學:實作端到端即時資料管道:從資料擷取到機器學習
Valliappa Lakshmanan
- 出版商: O'Reilly
- 出版日期: 2018-01-23
- 定價: $2,180
- 售價: 5.0 折 $1,090
- 語言: 英文
- 頁數: 410
- 裝訂: Paperback
- ISBN: 1491974567
- ISBN-13: 9781491974568
-
相關分類:
Google Cloud、Machine Learning、Data Science
-
相關翻譯:
基於雲計算的數據科學 (簡中版)
-
其他版本:
Data Science on the Google Cloud Platform: Implementing End-To-End Real-Time Data Pipelines: From Ingest to Machine Learning, 2/e (Paperback)
買這商品的人也買了...
-
$950$903 -
$1,020$969 -
$1,020$969 -
$640$608 -
$1,020$969 -
$520$442 -
$520$442 -
$520$442 -
$505圖解Spark:核心技術與案例實戰
-
$580$458 -
$520$442 -
$990Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 2/e (Paperback)
-
$403AWS Lambda 實戰 : 開發事件驅動的無服務器應用程序 (AWS Lambda in Action: Event-Driven Serverless Applications)
-
$254亞馬遜 AWS 雲基礎與實戰
-
$1,690$1,606 -
$474$450 -
$352關聯數據:萬維網上的結構化數據
-
$680$578 -
$419$398 -
$505機器學習即服務:將 Python 機器學習創意快速轉變為雲端 Web 應用程序 (Monetizing Machine Learning: Quickly Turn Python ML Ideas into Web Applications on the Serverless Cloud)
-
$1,568Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
-
$454Python 3反爬蟲原理與繞過實戰
-
$653AWS 高級網絡官方學習指南 (專項領域) (AWS Certified Advanced Networking Official Study Guide: Specialty Exam)
-
$414$393 -
$539$512
相關主題
商品描述
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you’ll work through a sample business decision by employing a variety of data science approaches.
Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science.
You’ll learn how to:
- Automate and schedule data ingest, using an App Engine application
- Create and populate a dashboard in Google Data Studio
- Build a real-time analysis pipeline to carry out streaming analytics
- Conduct interactive data exploration with Google BigQuery
- Create a Bayesian model on a Cloud Dataproc cluster
- Build a logistic regression machine-learning model with Spark
- Compute time-aggregate features with a Cloud Dataflow pipeline
- Create a high-performing prediction model with TensorFlow
- Use your deployed model as a microservice you can access from both batch and real-time pipelines
商品描述(中文翻譯)
學習如何輕鬆地將複雜的統計和機器學習方法應用於現實世界的問題,當你在 Google Cloud Platform (GCP) 上進行開發時。本手冊將指導進入資料科學領域的開發者如何實現端到端的資料管道,使用 GCP 上的統計和機器學習方法及工具。在本書的過程中,你將通過採用各種資料科學方法來處理一個範例商業決策。
透過在 GCP 上實施這些統計和機器學習解決方案,跟隨本書的步驟,並發現這個平台如何提供一種變革性且更具協作性的資料科學方式。
你將學會如何:
- 使用 App Engine 應用程式自動化和排程資料攝取
- 在 Google Data Studio 中創建和填充儀表板
- 建立實時分析管道以進行串流分析
- 使用 Google BigQuery 進行互動式資料探索
- 在 Cloud Dataproc 叢集上創建貝葉斯模型
- 使用 Spark 建立邏輯回歸機器學習模型
- 使用 Cloud Dataflow 管道計算時間聚合特徵
- 使用 TensorFlow 創建高效能的預測模型
- 將已部署的模型作為微服務,從批次和實時管道中訪問