Scalability Challenges in Web Search Engines (Synthesis Lectures on Information Concepts, Retrieval, and Services)
暫譯: 網路搜尋引擎的可擴展性挑戰(資訊概念、檢索與服務綜合講座)

B. Barla Cambazoglu, Ricardo Baeza-Yates

  • 出版商: Morgan & Claypool
  • 出版日期: 2015-12-01
  • 售價: $1,780
  • 貴賓價: 9.5$1,691
  • 語言: 英文
  • 頁數: 140
  • 裝訂: Paperback
  • ISBN: 1627058125
  • ISBN-13: 9781627058124
  • 相關分類: JVM 語言
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

In this book, we aim to provide a fairly comprehensive overview of the scalability and efficiency challenges in large-scale web search engines. More specifically, we cover the issues involved in the design of three separate systems that are commonly available in every web-scale search engine: web crawling, indexing, and query processing systems. We present the performance challenges encountered in these systems and review a wide range of design alternatives employed as solution to these challenges, specifically focusing on algorithmic and architectural optimizations. We discuss the available optimizations at different computational granularities, ranging from a single computer node to a collection of data centers. We provide some hints to both the practitioners and theoreticians involved in the field about the way large-scale web search engines operate and the adopted design choices. Moreover, we survey the efficiency literature, providing pointers to a large number of relatively important research papers. Finally, we discuss some open research problems in the context of search engine efficiency.

商品描述(中文翻譯)

在本書中,我們旨在提供一個相當全面的概述,探討大型網路搜尋引擎中的可擴展性和效率挑戰。更具體地說,我們涵蓋了設計三個在每個網路規模搜尋引擎中常見的獨立系統所涉及的問題:網路爬蟲、索引和查詢處理系統。我們呈現了這些系統中遇到的性能挑戰,並回顧了作為解決這些挑戰所採用的各種設計替代方案,特別專注於算法和架構的優化。我們討論了在不同計算粒度下可用的優化,範圍從單一計算機節點到數據中心的集合。我們為從事該領域的實務工作者和理論家提供了一些提示,關於大型網路搜尋引擎的運作方式及其採用的設計選擇。此外,我們調查了效率文獻,提供了大量相對重要的研究論文的指引。最後,我們在搜尋引擎效率的背景下討論了一些未解的研究問題。