Big Data Analytics with Java
暫譯: 使用 Java 進行大數據分析

Rajat Mehta

買這商品的人也買了...

商品描述

Key Features

  • Acquire real-world set of tools for building enterprise level data science applications
  • Surpasses the barrier of other languages in data science and learn create useful object-oriented codes
  • Extensive use of Java compliant big data tools like apache spark, Hadoop, etc.

Book Description

This book covers case studies such as sentiment analysis on a tweet dataset, recommendations on a movielens dataset, customer segmentation on an ecommerce dataset, and graph analysis on actual flights dataset.

This book is an end-to-end guide to implement analytics on big data with Java. Java is the de facto language for major big data environments, including Hadoop. This book will teach you how to perform analytics on big data with production-friendly Java. This book basically divided into two sections. The first part is an introduction that will help the readers get acquainted with big data environments, whereas the second part will contain a hardcore discussion on all the concepts in analytics on big data. It will take you from data analysis and data visualization to the core concepts and advantages of machine learning, real-life usage of regression and classification using Naive Bayes, a deep discussion on the concepts of clustering,and a review of simple neural networks on big data using deepLearning4j or plain Java Spark code. This book is a must-have book for Java developers who want to start learning big data analytics and want to use it in the real world.

What you will learn

  • Start from simple analytic tasks on big data
  • Get into more complex tasks with predictive analytics on big data using machine learning
  • Learn real time analytic tasks
  • Understand the concepts with examples and case studies
  • Prepare and refine data for analysis
  • Create charts in order to understand the data
  • See various real-world datasets

About the Author

The author is a VP (Technical Architect) in technology in JP Morgan Chase in New York. The author is a sun certified java developer and has worked on java related technologies for more than 16 years. Current role for the past few years heavily involves the usage of bid data stack and running analytics on it. Author is also a contributor in various open source projects that are available on his GitHub repository and is also a frequent write on dev magazines.

Table of Contents

  1. Big Data Analytics with Java
  2. First Steps on Data Analysis
  3. Data Visualization
  4. Basics of Machine Learning
  5. Regression on Big Data
  6. Naive Bayes and Sentiment Analysis
  7. Classification using Decision Trees
  8. Classification on ensemble of Decision Trees
  9. Recommendations on Big Data
  10. Clustering in Action on Big Data
  11. Building graphs on Big Data
  12. Streaming on Big Data
  13. Deep Learning Using Big Data

商品描述(中文翻譯)

關鍵特點
- 獲得構建企業級數據科學應用的實用工具集
- 超越其他語言在數據科學中的障礙,學會創建有用的面向對象代碼
- 廣泛使用符合 Java 的大數據工具,如 Apache Spark、Hadoop 等

書籍描述
本書涵蓋了案例研究,例如對推文數據集的情感分析、對 movielens 數據集的推薦、對電子商務數據集的客戶細分,以及對實際航班數據集的圖形分析。
本書是使用 Java 在大數據上實施分析的端到端指南。Java 是主要大數據環境(包括 Hadoop)的事實標準語言。本書將教您如何使用適合生產環境的 Java 在大數據上執行分析。本書基本上分為兩個部分。第一部分是介紹,幫助讀者熟悉大數據環境,而第二部分將包含對大數據分析中所有概念的深入討論。它將帶您從數據分析和數據可視化到機器學習的核心概念和優勢,使用 Naive Bayes 進行回歸和分類的實際應用,對聚類概念的深入討論,以及使用 deepLearning4j 或純 Java Spark 代碼在大數據上回顧簡單神經網絡。本書是希望開始學習大數據分析並希望在現實世界中使用的 Java 開發人員必備的書籍。

您將學到什麼
- 從簡單的數據分析任務開始
- 使用機器學習在大數據上進行更複雜的預測分析任務
- 學習實時分析任務
- 通過示例和案例研究理解概念
- 準備和精煉數據以進行分析
- 創建圖表以理解數據
- 查看各種現實世界數據集

關於作者
作者是摩根大通(JP Morgan Chase)在紐約的技術副總裁(技術架構師)。作者是 Sun 認證的 Java 開發人員,並在 Java 相關技術上工作了超過 16 年。過去幾年的當前角色主要涉及使用大數據堆棧並在其上運行分析。作者還是其 GitHub 存儲庫中各種開源項目的貢獻者,並且經常在開發雜誌上撰寫文章。

目錄
1. 使用 Java 進行大數據分析
2. 數據分析的第一步
3. 數據可視化
4. 機器學習基礎
5. 大數據上的回歸
6. Naive Bayes 和情感分析
7. 使用決策樹進行分類
8. 決策樹集成的分類
9. 大數據上的推薦
10. 大數據中的聚類實踐
11. 在大數據上構建圖形
12. 大數據流處理
13. 使用大數據的深度學習

最後瀏覽商品 (20)