HADOOP. Integration in IBM, Microsoft and SAS

James Braselton

  • 出版商: CreateSpace Independ
  • 出版日期: 2014-08-06
  • 售價: $1,160
  • 貴賓價: 9.5$1,102
  • 語言: 英文
  • 頁數: 148
  • 裝訂: Paperback
  • ISBN: 1500755656
  • ISBN-13: 9781500755652
  • 相關分類: Hadoop
  • 無法訂購

相關主題

商品描述

There are mountains of untapped potential in our information. Until now, it’s been too cost prohibitive to analyze these massive volumes. Of course, there’s also been a staggering opportunity cost associated with not tapping into this information, as the potential of this yet-to-be-analyzed information is near-limitless. And we’re not just talking the ubiquitous “competitive differentiation” marketing slogan here; we’re talking innovation, discovery, association, and pretty much any- thing else that could make the way you work tomorrow very different, with even more tangible results and insight, from the way you work today. People and organizations have attempted to tackle this problem from many different angles. Of course, the angle that is currently leading the pack in terms of popularity for massive data analysis is an open source project called Hadoop. Hadoop is shipped as part of IBM tools, SQL Server 2014 and SAS applications. Hadoop is shipped as part of the IBM InfoSphere BigInsights (BigInsights) platform. Quite simply, BigInsights embraces, hardens, and extends the Hadoop open source framework with enterprise-grade security, governance, availability, integration into existing data stores, tooling that simplifies and improves developer productivity, scalability, analytic toolkits, and more. BigInsights is (and will always be) based on the nonforked core Hadoop distribution, and backwards compatibility. Hadoop is also shipped as part of SAS tools. SAS incorporated Hadoop into their applications (SAS Base, SAS Data Integration, Sas Enterpris Guide and SAS Enterprise Miner). Same SAS aplications works in-memory on Hadoop (In-memory Statistics, SAS Visual Analytics and SAS Visual Statistics). Finaly, Hadoop is also shipped as part of Microsoft SQL Server 2014 and HDInsight. SQL Server 2014 works in-memory across Hadoop. HDInsight is a Hadoop-based platform that you can use to process data of all kinds in the cloud. In particular, HDInsight is useful for processing high volumes of structured and unstructured data, which traditional relational database systems typically cannot support for a variety of reasons. HDInsight allows you to quickly establish an infrastructure for big data analysis, whether you want to develop a proof of concept for a big data solution or support ongoing analytical requirements in a production environment. Furthermore, HDInsight integrates with Microsoft’s business-intelligence tools to enable users to enhance big data with additional sources and then explore and analyze the results to gain deeper insights. Most functionality within HDInsight and other Hadoop distributions is similar. Consequently, any current experience with Hadoop is largely transferable. Keep in mind that interaction with HDInsight requires you to use Windows Azure PowerShell commands, so a basic knowledge of PowerShell is required to work with the cluster.

商品描述(中文翻譯)

我們的信息中潛藏著巨大的未開發潛力。直到現在,分析這些龐大的數據量的成本一直過於高昂。當然,不利用這些信息也伴隨著驚人的機會成本,因為這些尚未分析的信息的潛力幾乎是無限的。我們不僅僅是在談論那句無處不在的「競爭差異化」的行銷口號;我們在談論創新、發現、關聯,以及幾乎任何其他能使你明天的工作方式與今天截然不同,並帶來更具體的結果和洞察的事物。人們和組織已經從許多不同的角度嘗試解決這個問題。當然,目前在大數據分析方面最受歡迎的角度是一個名為Hadoop的開源專案。Hadoop作為IBM工具、SQL Server 2014和SAS應用程式的一部分進行發佈。Hadoop也是IBM InfoSphere BigInsights平台的一部分。簡單來說,BigInsights擁抱、強化並擴展了Hadoop開源框架,提供企業級的安全性、治理、可用性、與現有數據存儲的整合、簡化並提升開發者生產力的工具、可擴展性、分析工具包等。BigInsights基於未分叉的核心Hadoop發行版,並保持向後相容性。Hadoop也作為SAS工具的一部分進行發佈。SAS將Hadoop整合到他們的應用程式中(SAS Base、SAS Data Integration、SAS Enterprise Guide和SAS Enterprise Miner)。同樣的SAS應用程式在Hadoop上以內存方式運行(內存統計、SAS Visual Analytics和SAS Visual Statistics)。最後,Hadoop也作為Microsoft SQL Server 2014和HDInsight的一部分進行發佈。SQL Server 2014在Hadoop上以內存方式運行。HDInsight是一個基於Hadoop的平台,您可以用來在雲端處理各種數據。特別是,HDInsight對於處理大量結構化和非結構化數據非常有用,因為傳統的關聯數據庫系統通常因各種原因無法支持這些數據。HDInsight允許您快速建立大數據分析的基礎設施,無論您是想為大數據解決方案開發概念驗證,還是支持生產環境中的持續分析需求。此外,HDInsight與Microsoft的商業智能工具整合,使得用戶能夠用額外的數據源增強大數據,然後探索和分析結果以獲得更深入的洞察。HDInsight和其他Hadoop發行版中的大多數功能是相似的。因此,任何目前對Hadoop的經驗在很大程度上都是可轉移的。請注意,與HDInsight的互動需要使用Windows Azure PowerShell命令,因此需要具備基本的PowerShell知識才能與集群進行操作。