Advanced Analytics with Transact-SQL: Exploring Hidden Patterns and Rules in Your Data
暫譯: 使用 Transact-SQL 進行進階分析:探索數據中的隱藏模式與規則

Sarka, Dejan

  • 出版商: Apress
  • 出版日期: 2021-07-17
  • 售價: $1,510
  • 貴賓價: 9.5$1,435
  • 語言: 英文
  • 頁數: 302
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1484271726
  • ISBN-13: 9781484271728
  • 相關分類: SQLMSSQL
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Learn about business intelligence (BI) features in T-SQL and how they can help you with data science and analytics efforts without the need to bring in other languages such as R and Python. This book shows you how to compute statistical measures using your existing skills in T-SQL. You will learn how to calculate descriptive statistics, including centers, spreads, skewness, and kurtosis of distributions. You will also learn to find associations between pairs of variables, including calculating linear regression formulas and confidence levels with definite integration.
No analysis is good without data quality. Advanced Analytics with Transact-SQL introduces data quality issues and shows you how to check for completeness and accuracy, and measure improvements in data quality over time. The book also explains how to optimize queries involving temporal data, such as when you search for overlapping intervals. More advanced time-oriented information in the book includes hazard and survival analysis. Forecasting with exponential moving averages and autoregression is covered as well.

Every web/retail shop wants to know the products customers tend to buy together. Trying to predict the target discrete or continuous variable with few input variables is important for practically every type of business. This book helps you understand data science and the advanced algorithms use to analyze data, and terms such as data mining, machine learning, and text mining.


Key to many of the solutions in this book are T-SQL window functions. Author Dejan Sarka demonstrates efficient statistical queries that are based on window functions and optimized through algorithms built using mathematical knowledge and creativity. The formulas and usage of those statistical procedures are explained so you can understand and modify the techniques presented.
T-SQL is supported in SQL Server, Azure SQL Database, and in Azure Synapse Analytics. There are so many BI features in T-SQL that it might become your primary analytic database language. If you want to learn how to get information from your data with the T-SQL language that you already are familiar with, then this is the book for you.

What You Will Learn

  • Describe distribution of variables with statistical measures
  • Find associations between pairs of variables
  • Evaluate the quality of the data you are analyzing
  • Perform time-series analysis on your data
  • Forecast values of a continuous variable
  • Perform market-basket analysis to predict customer purchasing patterns
  • Predict target variable outcomes from one or more input variables
  • Categorize passages of text by extracting and analyzing keywords

Who This Book Is For
Database developers and database administrators who want to translate their T-SQL skills into the world of business intelligence (BI) and data science. For readers who want to analyze large amounts of data efficiently by using their existing knowledge of T-SQL and Microsoft's various database platforms such as SQL Server and Azure SQL Database. Also for readers who want to improve their querying by learning new and original optimization techniques.

商品描述(中文翻譯)

了解 T-SQL 中的商業智慧 (BI) 功能,以及它們如何幫助您進行資料科學和分析工作,而無需引入其他語言,如 R 和 Python。本書展示了如何利用您現有的 T-SQL 技能計算統計指標。您將學習如何計算描述性統計,包括分佈的中心、範圍、偏度和峰度。您還將學習如何尋找變數對之間的關聯,包括計算線性回歸公式和使用確定性積分計算信心水準。

沒有數據質量的分析是無法稱之為好的分析。使用 Transact-SQL 的進階分析 介紹了數據質量問題,並展示了如何檢查數據的完整性和準確性,以及如何隨著時間的推移衡量數據質量的改善。本書還解釋了如何優化涉及時間數據的查詢,例如當您搜尋重疊區間時。本書中更進階的時間導向資訊包括風險和生存分析。還涵蓋了使用指數移動平均和自回歸進行預測的內容。

每個網路/零售商店都想知道顧客傾向於一起購買的產品。嘗試用少量輸入變數預測目標離散或連續變數對於幾乎每種類型的業務都是重要的。本書幫助您理解資料科學及其用於分析數據的進階演算法,以及資料探勘、機器學習和文本探勘等術語。

本書中許多解決方案的關鍵是 T-SQL 窗口函數。作者 Dejan Sarka 演示了基於窗口函數的高效統計查詢,並通過使用數學知識和創造力構建的演算法進行優化。這些統計程序的公式和用法得到了說明,以便您能夠理解並修改所呈現的技術。

T-SQL 在 SQL Server、Azure SQL Database 和 Azure Synapse Analytics 中受到支持。T-SQL 中有如此多的 BI 功能,以至於它可能成為您主要的分析數據庫語言。如果您想學習如何使用您已經熟悉的 T-SQL 語言從數據中獲取資訊,那麼這本書就是為您而寫的。

您將學到什麼


  • 使用統計指標描述變數的分佈


  • 尋找變數對之間的關聯

  • 評估您正在分析的數據質量

  • 對您的數據進行時間序列分析

  • 預測連續變數的值

  • 執行市場籃分析以預測顧客購買模式

  • 從一個或多個輸入變數預測目標變數的結果

  • 通過提取和分析關鍵字對文本段落進行分類

本書適合誰

希望將其 T-SQL 技能轉換為商業智慧 (BI) 和資料科學領域的資料庫開發人員和資料庫管理員。適合希望利用其現有的 T-SQL 知識和 Microsoft 的各種資料庫平台(如 SQL Server 和 Azure SQL Database)高效分析大量數據的讀者。也適合希望通過學習新的和原創的優化技術來改善查詢的讀者。

作者簡介

​Dejan Sarka, MCT and Data Platform MVP, is an independent trainer and consultant with more than 30 years of experience who focuses on development of database and business intelligence (BI) applications. He works on projects, and spends about half of his time on training and mentoring. He is the founder of the Slovenian SQL Server and .NET Users Group. Dejan Sarka is the main author or co-author of 19 books about databases and SQL Server, and has developed many courses and seminars for Microsoft, Radacad, SolidQ, and Pluralsight.

作者簡介(中文翻譯)

Dejan Sarka,MCT 和 Data Platform MVP,是一位擁有超過 30 年經驗的獨立培訓師和顧問,專注於資料庫和商業智慧 (BI) 應用程式的開發。他參與各種專案,並將約一半的時間用於培訓和指導。他是斯洛維尼亞 SQL Server 和 .NET 使用者群的創辦人。Dejan Sarka 是 19 本有關資料庫和 SQL Server 書籍的主要作者或合著者,並為 Microsoft、Radacad、SolidQ 和 Pluralsight 開發了許多課程和研討會。