Exploring Data with RapidMiner
暫譯: 使用 RapidMiner 探索數據

Andrew Chisholm

  • 出版商: Packt Publishing
  • 出版日期: 2013-11-17
  • 售價: $1,690
  • 貴賓價: 9.5$1,606
  • 語言: 英文
  • 頁數: 162
  • 裝訂: Paperback
  • ISBN: 1782169334
  • ISBN-13: 9781782169338
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

RapidMiner is a highly versatile tool that can make data work harder for you. This book will show you how to import, parse, and structure your data with remarkable speed and efficiency. It's data mining made accessible.

Overview

  • See how to import, parse, and structure your data quickly and effectively
  • Understand the visualization possibilities and be inspired to use these with your own data
  • Structured in a modular way to adhere to standard industry processes

In Detail

Data is everywhere and the amount is increasing so much that the gap between what people can understand and what is available is widening relentlessly. There is a huge value in data, but much of this value lies untapped. 80% of data mining is about understanding data, exploring it, cleaning it, and structuring it so that it can be mined. RapidMiner is an environment for machine learning, data mining, text mining, predictive analytics, and business analytics. It is used for research, education, training, rapid prototyping, application development, and industrial applications.

Exploring Data with RapidMiner is packed with practical examples to help practitioners get to grips with their own data. The chapters within this book are arranged within an overall framework and can additionally be consulted on an ad-hoc basis. It provides simple to intermediate examples showing modeling, visualization, and more using RapidMiner.

Exploring Data with RapidMiner is a helpful guide that presents the important steps in a logical order. This book starts with importing data and then lead you through cleaning, handling missing values, visualizing, and extracting additional information, as well as understanding the time constraints that real data places on getting a result. The book uses real examples to help you understand how to set up processes, quickly..

This book will give you a solid understanding of the possibilities that RapidMiner gives for exploring data and you will be inspired to use it for your own work.

What you will learn from this book

  • Import real data from files in multiple formats and from databases
  • Extract features from structured and unstructured data
  • Restructure, reduce, and summarize data to help you understand it more easily and process it more quickly
  • Visualize data in new ways to help you understand it
  • Detect outliers and methods to handle them
  • Detect missing data and implement ways to handle it
  • Understand resource constraints and what to do about them

Approach

A step-by-step tutorial style using examples so that users of different levels will benefit from the facilities offered by RapidMiner.

Who this book is written for

If you are a computer scientist or an engineer who has real data from which you want to extract value, this book is ideal for you. You will need to have at least a basic awareness of data mining techniques and some exposure to RapidMiner.

商品描述(中文翻譯)

RapidMiner 是一個高度多功能的工具,可以讓數據為您更有效地工作。本書將向您展示如何以驚人的速度和效率導入、解析和結構化您的數據。這是讓數據挖掘變得可及的指南。

概述
- 了解如何快速有效地導入、解析和結構化您的數據
- 理解可視化的可能性,並受到啟發以將其應用於自己的數據
- 以模組化的方式結構化,以遵循標準行業流程

詳細內容
數據無處不在,且數量不斷增加,以至於人們能理解的內容與可用數據之間的差距不斷擴大。數據中蘊藏著巨大的價值,但這些價值中有很多尚未被開發。80% 的數據挖掘是關於理解數據、探索數據、清理數據和結構化數據,以便進行挖掘。RapidMiner 是一個用於機器學習、數據挖掘、文本挖掘、預測分析和商業分析的環境。它被用於研究、教育、培訓、快速原型開發、應用開發和工業應用。

《使用 RapidMiner 探索數據》充滿了實用的範例,幫助從業者掌握自己的數據。本書中的章節按照整體框架排列,並且可以根據需要隨時查閱。它提供了從簡單到中級的範例,展示了使用 RapidMiner 進行建模、可視化等。

《使用 RapidMiner 探索數據》是一個有用的指南,按邏輯順序呈現重要步驟。本書從導入數據開始,然後引導您進行清理、處理缺失值、可視化和提取額外信息,以及理解真實數據在獲得結果時所帶來的時間限制。本書使用真實範例幫助您理解如何快速設置流程。

本書將使您對 RapidMiner 在探索數據方面的可能性有一個堅實的理解,並激勵您將其應用於自己的工作。

您將從本書中學到的內容
- 從多種格式的文件和數據庫中導入真實數據
- 從結構化和非結構化數據中提取特徵
- 重構、減少和總結數據,以幫助您更輕鬆地理解和更快速地處理
- 以新的方式可視化數據,以幫助您理解
- 檢測異常值及其處理方法
- 檢測缺失數據並實施處理方法
- 理解資源限制及其應對措施

方法
使用範例的逐步教程風格,以便不同水平的用戶都能從 RapidMiner 提供的功能中受益。

本書的讀者對象
如果您是一位計算機科學家或工程師,擁有希望提取價值的真實數據,那麼本書非常適合您。您需要對數據挖掘技術有基本的認識,並對 RapidMiner 有一定的接觸。