R Data Mining
暫譯: R 數據挖掘

Andrea Cirillo

  • 出版商: Packt Publishing
  • 出版日期: 2017-11-28
  • 定價: $1,800
  • 售價: 6.0$1,080
  • 語言: 英文
  • 頁數: 442
  • 裝訂: Paperback
  • ISBN: 1787124460
  • ISBN-13: 9781787124462
  • 相關分類: R 語言Data-mining
  • 相關翻譯: R數據挖掘實戰 (簡中版)
  • 立即出貨 (庫存=1)

商品描述

Key Features

  • Understand the basics of data mining and why R is a perfect tool for it.
  • Manipulate your data using the popular R packages and gather valuable business insights from it.
  • Written in a clear, easy to understand manner, and includes lots of practical examples involving real-world datasets

Book Description

R is widely used in leveraging data mining techniques across many different industries, including finance, medicine, scientific research and more. This book will empower you to produce and show impressive analyses from the data, selecting and implementing the appropriate data mining techniques in R.

The book begins with a detailed introduction to data mining and why R is a popular alternative for it. You will get a comprehensive coverage of the various R packages which you can use in the data mining process. We will then proceed to use these packages for manipulating various datasets, through practical examples including real-world datasets. Implement algorithms like k-means, SVM, and more, and techniques like classification and cluster analysis to extract insightful patterns and associations. Topics like outlier detection, regression analysis, anomaly detection and network analysis are also covered, in a very easy to understand manner. You will also use the popular ggplot2 package to visualize the insights you get from the analysis, and aid your decision-making.

By the end of this book, you will have grasped the fundamentals of data mining, and the various techniques you can deploy with the popular R packages to get the most out of your data.

What you will learn

  • Get introduced to most relevant packages for data mining within the R environment.
  • Get confident about data quality and structure through data validation and exploratory data analysis
  • Learn relevant steps to validate all performed analysis
  • Develop a regression model from your real gmail data
  • Produce clear and effective reports to show analyses results
  • Get insights from your analyses using meaningful visualizations with ggplot2

商品描述(中文翻譯)

關鍵特點
- 了解資料探勘的基本概念,以及為什麼 R 是一個完美的工具。
- 使用流行的 R 套件來操作您的資料,並從中獲取有價值的商業洞察。
- 以清晰易懂的方式撰寫,並包含許多涉及真實世界資料集的實用範例。

書籍描述
R 在許多不同產業中廣泛用於利用資料探勘技術,包括金融、醫療、科學研究等。本書將使您能夠從資料中產生並展示令人印象深刻的分析,選擇並實施適當的資料探勘技術於 R 中。

本書首先詳細介紹資料探勘及為什麼 R 是一個受歡迎的替代方案。您將全面了解在資料探勘過程中可以使用的各種 R 套件。接著,我們將使用這些套件來操作各種資料集,透過包括真實世界資料集的實用範例來進行。實現像 k-means、SVM 等演算法,以及分類和聚類分析等技術,以提取有洞察力的模式和關聯。還將涵蓋異常值檢測、回歸分析、異常檢測和網路分析等主題,以非常易於理解的方式進行。您還將使用流行的 ggplot2 套件來視覺化您從分析中獲得的洞察,並協助您的決策。

在本書結束時,您將掌握資料探勘的基本原則,以及可以使用流行的 R 套件來充分利用您的資料的各種技術。

您將學到的內容
- 了解 R 環境中最相關的資料探勘套件。
- 透過資料驗證和探索性資料分析,對資料質量和結構充滿信心。
- 學習驗證所有執行分析的相關步驟。
- 從您的真實 Gmail 資料中開發回歸模型。
- 產生清晰有效的報告以顯示分析結果。
- 使用 ggplot2 進行有意義的視覺化,從您的分析中獲取洞察。

最後瀏覽商品 (20)