Programming Collective Intelligence: Building Smart Web 2.0 Applications (Paperback)
暫譯: 程式設計集體智慧:構建智慧型 Web 2.0 應用程式 (平裝本)

Toby Segaran

買這商品的人也買了...

商品描述

Description

Want to tap the power behind search rankings, product recommendations, social bookmarking, and online matchmaking? This fascinating book demonstrates how you can build Web 2.0 applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it.

Programming Collective Intelligence takes you into the world of machine learning and statistics, and explains how to draw conclusions about user experience, marketing, personal tastes, and human behavior in general -- all from information that you and others collect every day. Each algorithm is described clearly and concisely with code that can immediately be used on your web site, blog, Wiki, or specialized application. This book explains:

  • Collaborative filtering techniques that enable online retailers to recommend products or media
  • Methods of clustering to detect groups of similar items in a large dataset
  • Search engine features -- crawlers, indexers, query engines, and the PageRank algorithm
  • Optimization algorithms that search millions of possible solutions to a problem and choose the best one
  • Bayesian filtering, used in spam filters for classifying documents based on word types and other features
  • Using decision trees not only to make predictions, but to model the way decisions are made
  • Predicting numerical values rather than classifications to build price models
  • Support vector machines to match people in online dating sites
  • Non-negative matrix factorization to find the independent features in a dataset
  • Evolving intelligence for problem solving -- how a computer develops its skill by improving its own code the more it plays a game

Each chapter includes exercises for extending the algorithms to make them more powerful. Go beyond simple database-backed applications and put the wealth of Internet data to work for you.

 

商品描述(中文翻譯)

**描述**

想要利用搜尋排名、產品推薦、社交書籤和線上配對背後的力量嗎?這本引人入勝的書展示了如何構建 Web 2.0 應用程式,以挖掘互聯網上人們創造的龐大數據。透過本書中的複雜算法,您可以編寫智能程式,從其他網站訪問有趣的數據集,收集您自己應用程式的用戶數據,並在找到數據後進行分析和理解。

《Programming Collective Intelligence》帶您進入機器學習和統計的世界,並解釋如何從您和其他人每天收集的信息中得出有關用戶體驗、市場營銷、個人品味和人類行為的一般結論。每個算法都清晰簡潔地描述,並附有可以立即在您的網站、部落格、Wiki 或專門應用程式中使用的程式碼。本書解釋了:

- 使在線零售商能夠推薦產品或媒體的協作過濾技術
- 用於檢測大型數據集中相似項目組的聚類方法
- 搜尋引擎功能——爬蟲、索引器、查詢引擎和 PageRank 算法
- 搜尋數百萬個可能解決方案的優化算法,並選擇最佳解決方案
- 用於垃圾郵件過濾的貝葉斯過濾,根據單詞類型和其他特徵對文檔進行分類
- 使用決策樹不僅進行預測,還建模決策的方式
- 預測數值而非分類以建立價格模型
- 支持向量機用於在線約會網站中的配對
- 非負矩陣分解用於發現數據集中的獨立特徵
- 演化智能用於問題解決——計算機如何通過改善自己的程式碼來提升技能,隨著遊戲的進行而不斷進步

每章都包括延伸算法的練習,以使其更強大。超越簡單的數據庫支持應用程式,讓互聯網數據的豐富資源為您服務。