Machine Learning Pocket Reference Working with Structured Data in Python
暫譯: 機器學習口袋參考指南

Harrison, Matt

  • 出版商: O'Reilly
  • 出版日期: 2019-10-08
  • 定價: $880
  • 售價: 9.5$836
  • 語言: 英文
  • 頁數: 200
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1492047546
  • ISBN-13: 9781492047544
  • 相關分類: Machine Learning
  • 相關翻譯: 機器學習常用算法速查手冊 (簡中版)
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

With detailed notes, tables, and examples, this handy reference will help you navigate the basics of structured machine learning. Author Matt Harrison delivers a valuable guide that you can use for additional support during training and as a convenient resource when you dive into your next machine learning project.

Ideal for programmers, data scientists, and AI engineers, this book includes an overview of the machine learning process and walks you through classification with structured data. You'll also learn methods for clustering, predicting a continuous value (regression), and reducing dimensionality, among other topics.

This pocket reference includes sections that cover:

  • Classification, using the Titanic dataset
  • Cleaning data and dealing with missing data
  • Exploratory data analysis
  • Common preprocessing steps using sample data
  • Selecting features useful to the model
  • Model selection
  • Metrics and classification evaluation
  • Regression examples using k-nearest neighbor, decision trees, boosting, and more
  • Metrics for regression evaluation
  • Clustering
  • Dimensionality reduction
  • Scikit-learn pipelines

商品描述(中文翻譯)

這本實用的參考書籍提供詳細的註解、表格和範例,將幫助您掌握結構化機器學習的基本概念。作者 Matt Harrison 提供了一本有價值的指南,您可以在訓練過程中作為額外的支持,並在進行下一個機器學習專案時作為方便的資源。

本書非常適合程式設計師、資料科學家和人工智慧工程師,內容包括機器學習過程的概述,並逐步引導您了解使用結構化資料的分類。您還將學習到聚類、預測連續值(回歸)和降維等方法,以及其他主題。

這本口袋參考書包含以下幾個部分:

- 使用泰坦尼克號數據集的分類
- 數據清理和處理缺失數據
- 探索性數據分析
- 使用示例數據的常見預處理步驟
- 選擇對模型有用的特徵
- 模型選擇
- 指標和分類評估
- 使用 k-最近鄰、決策樹、提升等的回歸範例
- 回歸評估的指標
- 聚類
- 降維
- Scikit-learn 管道

作者簡介

Matt runs MetaSnake, a Python and Data Science training and consulting company. He has over 15 years of experience using Python across a breadth of domains: Data Science, BI, Storage, Testing and Automation, Open Source Stack Management, and Search.

作者簡介(中文翻譯)

Matt 經營 MetaSnake,一家專注於 Python 和數據科學的培訓與諮詢公司。他在多個領域擁有超過 15 年使用 Python 的經驗,包括數據科學、商業智慧 (BI)、儲存、測試與自動化、開源堆疊管理以及搜尋。