Python Feature Engineering Cookbook

Name: Python Feature Engineering Cookbook
Price: 1311 TWD
Availability: InStock
Author: Soledad Galli
ISBN: 1789806313

Soledad Galli

出版商: Packt Publishing
出版日期: 2020-01-22
售價: $1,380
貴賓價: 9.5 折 $1,311
語言: 英文
頁數: 372
裝訂: Quality Paper - also called trade paper
ISBN: 1789806313
ISBN-13: 9781789806311
相關分類: Python、程式語言

立即出貨 (庫存=1)

買這商品的人也買了...

~~$3,500~~ $3,325

Pattern Recognition and Machine Learning (Hardcover)
$2,993

The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2/e (Hardcover)
~~$2,980~~ $2,831

Learning Python, 5/e (Paperback)
~~$2,800~~ $2,660

Pro Oracle SQL, 2/e (Paperback)
$1,680

An Introduction to Statistical Learning: With Applications in R (Hardcover)
~~$2,020~~ $1,919

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data (Paperback)
$1,617

Deep Learning (Hardcover)
$990

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 2/e (Paperback)
$1,332

Feature Engineering Made Easy
~~$1,750~~ $1,715

Reinforcement Learning: An Introduction, 2/e (Hardcover)
~~$2,670~~ $2,537

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2/e (Paperback)
~~$4,810~~ $4,570

Exploratory Data Analysis (Classic Version)
~~$2,530~~ $2,404

Advanced R, 2/e (Paperback)
~~$834~~ $792

數據挖掘導論, 2/e (Introduction to Data Mining, 2/e)
~~$3,320~~ $3,154

Feature Engineering and Selection: A Practical Approach for Predictive Models (Hardcover)
~~$970~~ $922

The Wealth of Nations
~~$2,100~~ $1,995

Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3/e (Paperback)
~~$1,710~~ $1,625

Business Data Science: Combining Machine Learning and Economics to Optimize, Automate, and Accelerate Business Decisions
$2,288

Deep Reinforcement Learning Hands-On, 2/e (Paperback)
~~$2,204~~ $2,088

Software Engineering at Google: Lessons Learned from Programming Over Time (Paperback)
$1,421

Fundamentals of Machine Learning for Predictive Data Analytics : Algorithms, Worked Examples, and Case Studies, 2/e (Hardcover)
~~$1,980~~ $1,881

Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs (Paperback)

商品描述

Key Features

Discover solutions for feature generation, feature extraction, and feature selection
Uncover the end-to-end feature engineering process across continuous, discrete, and unstructured datasets
Implement modern feature extraction techniques using Python's pandas, scikit-learn, SciPy and NumPy libraries

Book Description

Feature engineering is invaluable for developing and enriching your machine learning models. In this cookbook, you will work with the best tools to streamline your feature engineering pipelines and techniques and simplify and improve the quality of your code.

Using Python libraries such as pandas, scikit-learn, Featuretools, and Feature-engine, you'll learn how to work with both continuous and discrete datasets and be able to transform features from unstructured datasets. You will develop the skills necessary to select the best features as well as the most suitable extraction techniques. This book will cover Python recipes that will help you automate feature engineering to simplify complex processes. You'll also get to grips with different feature engineering strategies, such as the box-cox transform, power transform, and log transform across machine learning, reinforcement learning, and natural language processing (NLP) domains.

By the end of this book, you'll have discovered tips and practical solutions to all of your feature engineering problems.

What you will learn

Simplify your feature engineering pipelines with powerful Python packages
Get to grips with imputing missing values
Encode categorical variables with a wide set of techniques
Extract insights from text quickly and effortlessly
Develop features from transactional data and time series data
Derive new features by combining existing variables
Understand how to transform, discretize, and scale your variables
Create informative variables from date and time

Who this book is for

This book is for machine learning professionals, AI engineers, data scientists, and NLP and reinforcement learning engineers who want to optimize and enrich their machine learning models with the best features. Knowledge of machine learning and Python coding will assist you with understanding the concepts covered in this book.

商品描述(中文翻譯)

主要特點

發現特徵生成、特徵提取和特徵選擇的解決方案

了解連續、離散和非結構化數據集的端到端特徵工程過程

使用Python的pandas、scikit-learn、SciPy和NumPy庫實現現代特徵提取技術

書籍描述

特徵工程對於開發和豐富機器學習模型至關重要。在這本食譜中，您將使用最佳工具來簡化特徵工程流程和技術，簡化和改進代碼的質量。

使用Python庫，如pandas、scikit-learn、Featuretools和Feature-engine，您將學習如何處理連續和離散數據集，並能夠從非結構化數據集中轉換特徵。您將開發選擇最佳特徵以及最適合的提取技術所需的技能。本書將涵蓋幫助您自動化特徵工程以簡化複雜流程的Python食譜。您還將掌握不同的特徵工程策略，例如box-cox轉換、power轉換和log轉換，涵蓋機器學習、強化學習和自然語言處理（NLP）領域。

通過閱讀本書，您將發現解決所有特徵工程問題的技巧和實用解決方案。

您將學到什麼

使用強大的Python包簡化特徵工程流程

掌握填補缺失值的方法

使用多種技術對分類變量進行編碼

快速輕鬆地從文本中提取洞察力

從交易數據和時間序列數據中開發特徵

通過結合現有變量來衍生新特徵

了解如何轉換、離散化和縮放變量

從日期和時間中創建有信息量的變量

本書適合對象

本書適合機器學習專業人士、AI工程師、數據科學家以及NLP和強化學習工程師，他們希望通過最佳特徵優化和豐富他們的機器學習模型。機器學習和Python編程的知識將有助於您理解本書中涵蓋的概念。

作者簡介

Soledad Galli is a lead data scientist with more than 10 years of experience in world-class academic institutions and renowned businesses. She has researched, developed, and put into production machine learning models for insurance claims, credit risk assessment, and fraud prevention. Soledad received a Data Science Leaders' award in 2018 and was named one of LinkedIn's voices in data science and analytics in 2019. She is passionate about enabling people to step into and excel in data science, which is why she mentors data scientists and speaks at data science meetings regularly. She also teaches online courses on machine learning in a prestigious Massive Open Online Course platform, which have reached more than 10,000 students worldwide.

作者簡介(中文翻譯)

Soledad Galli 是一位領先的資料科學家，擁有超過10年的經驗，曾在世界一流的學術機構和知名企業工作。她研究、開發並將機器學習模型應用於保險理賠、信用風險評估和詐騙預防。Soledad在2018年獲得了資料科學領袖獎，並在2019年被列為LinkedIn資料科學和分析領域的聲音之一。她熱衷於幫助人們進入並在資料科學領域取得卓越成就，因此她定期指導資料科學家並在資料科學會議上演講。她還在一個知名的大規模開放式網路課程平台上教授機器學習的線上課程，已經吸引了全球超過10,000名學生。