Big Data Analytics with R (Paperback)
暫譯: 使用 R 進行大數據分析 (平裝本)

Simon Walkowiak

買這商品的人也買了...

商品描述

Key Features

  • Perform computational analyses on Big Data to generate meaningful results
  • Get a practical knowledge of R programming language while working on Big Data platforms like Hadoop, Spark, H2O and SQL/NoSQL databases,
  • Explore fast, streaming, and scalable data analysis with the most cutting-edge technologies in the market

Book Description

Big Data analytics is the process of examining large and complex data sets that often exceed the computational capabilities. R is a leading programming language of data science, consisting of powerful functions to tackle all problems related to Big Data processing.

The book will begin with a brief introduction to the Big Data world and its current industry standards. With introduction to the R language and presenting its development, structure, applications in real world, and its shortcomings. Book will progress towards revision of major R functions for data management and transformations. Readers will be introduce to Cloud based Big Data solutions (e.g. Amazon EC2 instances and Amazon RDS, Microsoft Azure and its HDInsight clusters) and also provide guidance on R connectivity with relational and non-relational databases such as MongoDB and HBase etc. It will further expand to include Big Data tools such as Apache Hadoop ecosystem, HDFS and MapReduce frameworks. Also other R compatible tools such as Apache Spark, its machine learning library Spark MLlib, as well as H2O.

What you will learn

  • Learn about current state of Big Data processing using R programming language and its powerful statistical capabilities
  • Deploy Big Data analytics platforms with selected Big Data tools supported by R in a cost-effective and time-saving manner
  • Apply the R language to real-world Big Data problems on a multi-node Hadoop cluster, e.g. electricity consumption across various socio-demographic indicators and bike share scheme usage
  • Explore the compatibility of R with Hadoop, Spark, SQL and NoSQL databases, and H2O platform

商品描述(中文翻譯)

**主要特點**
- 對大數據進行計算分析,以產生有意義的結果
- 在像 Hadoop、Spark、H2O 和 SQL/NoSQL 數據庫等大數據平台上,獲得 R 程式語言的實用知識
- 探索市場上最前沿技術的快速、流式和可擴展數據分析

**書籍描述**
大數據分析是檢查大型和複雜數據集的過程,這些數據集通常超出計算能力。R 是數據科學的主要程式語言,擁有強大的函數來解決與大數據處理相關的所有問題。

本書將以簡要介紹大數據世界及其當前行業標準開始。接著介紹 R 語言,並展示其發展、結構、在現實世界中的應用及其不足之處。本書將進一步修訂主要的 R 函數,用於數據管理和轉換。讀者將了解基於雲的大數據解決方案(例如 Amazon EC2 實例和 Amazon RDS、Microsoft Azure 及其 HDInsight 集群),並提供 R 與關聯和非關聯數據庫(如 MongoDB 和 HBase 等)的連接指導。內容還將擴展到包括大數據工具,如 Apache Hadoop 生態系統、HDFS 和 MapReduce 框架,以及其他 R 兼容工具,如 Apache Spark、其機器學習庫 Spark MLlib,以及 H2O。

**您將學到的內容**
- 了解使用 R 程式語言及其強大統計能力的大數據處理現狀
- 以具成本效益和節省時間的方式,部署支持 R 的選定大數據工具的大數據分析平台
- 將 R 語言應用於多節點 Hadoop 集群上的現實世界大數據問題,例如各種社會人口指標的電力消耗和自行車共享計劃的使用情況
- 探索 R 與 Hadoop、Spark、SQL 和 NoSQL 數據庫及 H2O 平台的兼容性