Data Wrangling with SQL: A hands-on guide to manipulating, wrangling, and engineering data using SQL
暫譯: 使用 SQL 進行數據處理:操作、整理和工程數據的實用指南

Kandarpa, Raghav, Saxena, Shivangi

  • 出版商: Packt Publishing
  • 出版日期: 2023-07-31
  • 售價: $1,600
  • 貴賓價: 9.5$1,520
  • 語言: 英文
  • 頁數: 350
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 183763002X
  • ISBN-13: 9781837630028
  • 相關分類: SQL
  • 海外代購書籍(需單獨結帳)

商品描述

Become a data wrangling expert and make well-informed decisions by effectively utilizing and analyzing raw unstructured data in a systematic manner

Purchase of the print or Kindle book includes a free PDF eBook

Key Features

  • Implement query optimization during data wrangling using the SQL language with practical use cases
  • Master data cleaning, handle the date function and null value, and write subqueries and window functions
  • Practice self-assessment questions for SQL-based interviews and real-world case study rounds

Book Description

The amount of data generated continues to grow rapidly, making it increasingly important for businesses to be able to wrangle this data and understand it quickly and efficiently. Although data wrangling can be challenging, with the right tools and techniques you can efficiently handle enormous amounts of unstructured data.

The book starts by introducing you to the basics of SQL, focusing on the core principles and techniques of data wrangling. You’ll then explore advanced SQL concepts like aggregate functions, window functions, CTEs, and subqueries that are very popular in the business world. The next set of chapters will walk you through different functions within SQL query that cause delays in data transformation and help you figure out the difference between a good query and bad one. You’ll also learn how data wrangling and data science go hand in hand. The book is filled with datasets and practical examples to help you understand the concepts thoroughly, along with best practices to guide you at every stage of data wrangling.

By the end of this book, you’ll be equipped with essential techniques and best practices for data wrangling, and will predominantly learn how to use clean and standardized data models to make informed decisions, helping businesses avoid costly mistakes.

What you will learn

  • Build time series models using data wrangling
  • Discover data wrangling best practices as well as tips and tricks
  • Find out how to use subqueries, window functions, CTEs, and aggregate functions
  • Handle missing data, data types, date formats, and redundant data
  • Build clean and efficient data models using data wrangling techniques
  • Remove outliers and calculate standard deviation to gauge the skewness of data

Who this book is for

This book is for data analysts looking for effective hands-on methods to manage and analyze large volumes of data using SQL. The book will also benefit data scientists, product managers, and basically any role wherein you are expected to gather data insights and develop business strategies using SQL as a language. If you are new to or have basic knowledge of SQL and databases and an understanding of data cleaning practices, this book will give you further insights into how you can apply SQL concepts to build clean, standardized data models for accurate analysis.

商品描述(中文翻譯)

成為數據處理專家,通過系統性地有效利用和分析原始非結構化數據來做出明智的決策。

購買印刷版或 Kindle 版書籍可獲得免費 PDF 電子書。

主要特點

- 在數據處理過程中使用 SQL 語言實施查詢優化,並提供實際案例
- 精通數據清理,處理日期函數和空值,並撰寫子查詢和窗口函數
- 練習針對 SQL 的面試自我評估問題和真實案例研究

書籍描述

隨著生成的數據量持續快速增長,企業能夠快速有效地處理和理解這些數據變得越來越重要。雖然數據處理可能具有挑戰性,但使用合適的工具和技術,您可以高效地處理大量非結構化數據。

本書首先介紹 SQL 的基礎知識,重點講解數據處理的核心原則和技術。接著,您將探索在商業世界中非常流行的高級 SQL 概念,如聚合函數、窗口函數、CTE 和子查詢。接下來的章節將引導您了解 SQL 查詢中導致數據轉換延遲的不同函數,並幫助您區分良好查詢和不良查詢的差異。您還將學習數據處理與數據科學如何相輔相成。本書充滿了數據集和實際範例,幫助您徹底理解這些概念,並提供最佳實踐以指導您在數據處理的每個階段。

在本書結束時,您將掌握數據處理的基本技術和最佳實踐,並主要學會如何使用乾淨和標準化的數據模型來做出明智的決策,幫助企業避免代價高昂的錯誤。

您將學到的內容

- 使用數據處理構建時間序列模型
- 發現數據處理的最佳實踐以及技巧和竅門
- 瞭解如何使用子查詢、窗口函數、CTE 和聚合函數
- 處理缺失數據、數據類型、日期格式和冗餘數據
- 使用數據處理技術構建乾淨且高效的數據模型
- 移除異常值並計算標準差以評估數據的偏斜度

本書適合對象

本書適合尋找有效實用方法來管理和分析大量數據的數據分析師,使用 SQL 進行數據分析的數據科學家、產品經理,以及任何需要收集數據洞察並使用 SQL 作為語言來制定商業策略的角色。如果您是 SQL 和數據庫的新手或具備基本知識,並了解數據清理的實踐,本書將進一步幫助您了解如何應用 SQL 概念來構建乾淨、標準化的數據模型以進行準確分析。

目錄大綱

  1. Database Introduction
  2. Data Profiling and Preparation before Data Wrangling
  3. Data Wrangling on String Data Types
  4. Data Wrangling on the DATE Data Type
  5. Handling NULL values
  6. Pivoting Data Using SQL
  7. Subqueries and CTEs
  8. Aggregate Functions
  9. SQL Window Functions
  10. Optimizing Query Performance
  11. Descriptive Statistics with SQL
  12. Time Series with SQL
  13. Outlier Detection

目錄大綱(中文翻譯)


  1. Database Introduction

  2. Data Profiling and Preparation before Data Wrangling

  3. Data Wrangling on String Data Types

  4. Data Wrangling on the DATE Data Type

  5. Handling NULL values

  6. Pivoting Data Using SQL

  7. Subqueries and CTEs

  8. Aggregate Functions

  9. SQL Window Functions

  10. Optimizing Query Performance

  11. Descriptive Statistics with SQL

  12. Time Series with SQL

  13. Outlier Detection