Feature Engineering Bookcamp
暫譯: 特徵工程實戰營
Ozdemir, Sinan
- 出版商: Manning
- 出版日期: 2022-11-15
- 定價: $2,100
- 售價: 8.8 折 $1,848 (限時優惠至 2025-03-31)
- 語言: 英文
- 頁數: 272
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1617299790
- ISBN-13: 9781617299797
-
相關分類:
大數據 Big-data、Machine Learning、Data Science
-
相關翻譯:
特徵工程訓練營 (簡中版)
立即出貨 (庫存=1)
買這商品的人也買了...
-
$2,800$2,660 -
$2,700$2,565 -
$834$792 -
$1,421Fundamentals of Machine Learning for Predictive Data Analytics : Algorithms, Worked Examples, and Case Studies, 2/e (Hardcover)
-
$1,980$1,881
商品描述
Deliver huge improvements to your machine learning pipelines without spending hours fine-tuning parameters! This book's practical case-studies reveal feature engineering techniques that upgrade your data wrangling--and your ML results. In Feature Engineering Bookcamp you will learn how to: Identify and implement feature transformations for your data
Build powerful machine learning pipelines with unstructured data like text and images
Quantify and minimize bias in machine learning pipelines at the data level
Use feature stores to build real-time feature engineering pipelines
Enhance existing machine learning pipelines by manipulating the input data
Use state-of-the-art deep learning models to extract hidden patterns in data Feature Engineering Bookcamp guides you through a collection of projects that give you hands-on practice with core feature engineering techniques. You'll work with feature engineering practices that speed up the time it takes to process data and deliver real improvements in your model's performance. This instantly-useful book skips the abstract mathematical theory and minutely-detailed formulas; instead you'll learn through interesting code-driven case studies, including tweet classification, COVID detection, recidivism prediction, stock price movement detection, and more. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology
Get better output from machine learning pipelines by improving your training data! Use feature engineering, a machine learning technique for designing relevant input variables based on your existing data, to simplify training and enhance model performance. While fine-tuning hyperparameters or tweaking models may give you a minor performance bump, feature engineering delivers dramatic improvements by transforming your data pipeline. About the book
Feature Engineering Bookcamp walks you through six hands-on projects where you'll learn to upgrade your training data using feature engineering. Each chapter explores a new code-driven case study, taken from real-world industries like finance and healthcare. You'll practice cleaning and transforming data, mitigating bias, and more. The book is full of performance-enhancing tips for all major ML subdomains--from natural language processing to time-series analysis. What's inside Identify and implement feature transformations
Build machine learning pipelines with unstructured data
Quantify and minimize bias in ML pipelines
Use feature stores to build real-time feature engineering pipelines
Enhance existing pipelines by manipulating input data About the reader
For experienced machine learning engineers familiar with Python. About the author
Sinan Ozdemir is the founder and CTO of Shiba, a former lecturer of Data Science at Johns Hopkins University, and the author of multiple textbooks on data science and machine learning. Table of Contents
1 Introduction to feature engineering
2 The basics of feature engineering
3 Healthcare: Diagnosing COVID-19
4 Bias and fairness: Modeling recidivism
5 Natural language processing: Classifying social media sentiment
6 Computer vision: Object recognition
7 Time series analysis: Day trading with machine learning
8 Feature stores
9 Putting it all together
Build powerful machine learning pipelines with unstructured data like text and images
Quantify and minimize bias in machine learning pipelines at the data level
Use feature stores to build real-time feature engineering pipelines
Enhance existing machine learning pipelines by manipulating the input data
Use state-of-the-art deep learning models to extract hidden patterns in data Feature Engineering Bookcamp guides you through a collection of projects that give you hands-on practice with core feature engineering techniques. You'll work with feature engineering practices that speed up the time it takes to process data and deliver real improvements in your model's performance. This instantly-useful book skips the abstract mathematical theory and minutely-detailed formulas; instead you'll learn through interesting code-driven case studies, including tweet classification, COVID detection, recidivism prediction, stock price movement detection, and more. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology
Get better output from machine learning pipelines by improving your training data! Use feature engineering, a machine learning technique for designing relevant input variables based on your existing data, to simplify training and enhance model performance. While fine-tuning hyperparameters or tweaking models may give you a minor performance bump, feature engineering delivers dramatic improvements by transforming your data pipeline. About the book
Feature Engineering Bookcamp walks you through six hands-on projects where you'll learn to upgrade your training data using feature engineering. Each chapter explores a new code-driven case study, taken from real-world industries like finance and healthcare. You'll practice cleaning and transforming data, mitigating bias, and more. The book is full of performance-enhancing tips for all major ML subdomains--from natural language processing to time-series analysis. What's inside Identify and implement feature transformations
Build machine learning pipelines with unstructured data
Quantify and minimize bias in ML pipelines
Use feature stores to build real-time feature engineering pipelines
Enhance existing pipelines by manipulating input data About the reader
For experienced machine learning engineers familiar with Python. About the author
Sinan Ozdemir is the founder and CTO of Shiba, a former lecturer of Data Science at Johns Hopkins University, and the author of multiple textbooks on data science and machine learning. Table of Contents
1 Introduction to feature engineering
2 The basics of feature engineering
3 Healthcare: Diagnosing COVID-19
4 Bias and fairness: Modeling recidivism
5 Natural language processing: Classifying social media sentiment
6 Computer vision: Object recognition
7 Time series analysis: Day trading with machine learning
8 Feature stores
9 Putting it all together
商品描述(中文翻譯)
在不花費數小時微調參數的情況下,為您的機器學習管道帶來巨大的改進!本書的實用案例研究揭示了提升數據處理和機器學習結果的特徵工程技術。
在 特徵工程實戰營 中,您將學習如何: 確定並實施數據的特徵轉換使用非結構化數據(如文本和圖像)構建強大的機器學習管道
在數據層面量化並最小化機器學習管道中的偏見
使用特徵存儲構建實時特徵工程管道
通過操作輸入數據來增強現有的機器學習管道
使用最先進的深度學習模型提取數據中的隱藏模式 特徵工程實戰營 引導您完成一系列項目,讓您親手實踐核心的特徵工程技術。您將學習加速數據處理的特徵工程實踐,並在模型性能上實現真正的改進。本書跳過抽象的數學理論和詳細的公式;相反,您將通過有趣的代碼驅動案例研究來學習,包括推文分類、COVID檢測、再犯預測、股價變動檢測等。 購買印刷版書籍可獲得 Manning Publications 提供的免費電子書,格式包括 PDF、Kindle 和 ePub。 關於技術
通過改善訓練數據來獲得更好的機器學習管道輸出!使用特徵工程,這是一種基於現有數據設計相關輸入變量的機器學習技術,以簡化訓練並增強模型性能。雖然微調超參數或調整模型可能會帶來輕微的性能提升,但特徵工程通過轉換數據管道提供了顯著的改進。 關於本書
特徵工程實戰營 帶您完成六個實踐項目,您將學習如何使用特徵工程升級您的訓練數據。每一章都探討一個新的代碼驅動案例研究,這些案例來自金融和醫療等真實行業。您將練習清理和轉換數據、減輕偏見等。本書充滿了針對所有主要機器學習子領域的性能提升技巧,從自然語言處理到時間序列分析。 內容概覽 確定並實施特徵轉換
使用非結構化數據構建機器學習管道
在機器學習管道中量化並最小化偏見
使用特徵存儲構建實時特徵工程管道
通過操作輸入數據增強現有管道 讀者對象
適合熟悉 Python 的經驗豐富的機器學習工程師。 作者介紹
Sinan Ozdemir 是 Shiba 的創始人兼首席技術官,曾任約翰霍普金斯大學數據科學講師,並著有多本數據科學和機器學習的教科書。 目錄
1 特徵工程介紹
2 特徵工程基礎
3 醫療保健:診斷 COVID-19
4 偏見與公平:建模再犯
5 自然語言處理:分類社交媒體情感
6 計算機視覺:物體識別
7 時間序列分析:使用機器學習進行日內交易
8 特徵存儲
9 整合所有內容