Mastering ElasticSearch
暫譯: 精通 ElasticSearch

Rafal Kuc, Marek Rogozinski

相關主題

商品描述

Written for intermediate users, this tutorial helps you utilize the power of Apache Lucene and Elastic Search to optimize your information retrieval. From design to implementation to management, it's the all-inclusive guide.

Overview

  • Learn about Apache Lucene and ElasticSearch design and architecture to fully understand how this great search engine works
  • Design, configure, and distribute your index, coupled with a deep understanding of the workings behind it
  • Learn about the advanced features in an easy to read book with detailed examples that will help you understand and use the sophisticated features of ElasticSearch

In Detail

ElasticSearch is fast, distributed, scalable, and written in the Java search engine that leverages Apache Lucene capabilities providing a new level of control over how you index and search even the largest set of data.

"Mastering ElasticSearch" covers the intermediate and advanced functionalities of ElasticSearch and will let you understand not only how ElasticSearch works, but will also guide you through its internals such as caches, Apache Lucene library, monitoring capabilities, and the Java API. In addition to that you'll see the practical usage of ElasticSearch configuration parameters, monitoring API, and easy-to-use and extend examples on how to extend ElasticSearch by writing your own plugins.

"Mastering ElasticSearch" starts by showing you how Apache Lucene works and what the ElasticSearch architecture looks like. It covers advanced querying capabilities, index configuration control, index distribution, ElasticSearch administration and troubleshooting. Finally you'll see how to improve the user’s search experience, use the provided Java API and develop your own custom plugins.

It will help you learn how Apache Lucene works both in terms of querying and indexing. You'll also learn how to use different scoring models, rescoring documents using other queries, alter how the index is written by using custom postings and what segments merging is, and how to configure it to your needs. You'll optimize your queries by modifying them to use filters and you'll see why it is important. The book describes in details how to use the shard allocation mechanism present in ElasticSearch such as forced awareness.

If you are looking for a book that will allow you to easily extend your basic knowledge about ElasticSearch or you want to go deeper into the world of full text search using ElasticSearch then this book is for you.

What you will learn from this book

  • Understand how Apache Lucene works
  • Use and configure different scoring models to alter default scoring mechanism
  • Exploit query rescore to recalculate the score of top N documents
  • Choose the right amount of shards and replicas for your deployment
  • Use shards allocation wisely and understand its internals
  • Alter the index format by using different postings format
  • Use your knowledge to create scalable, efficient, and fault tolerant clusters
  • Monitor your cluster by using and understanding the ElasticSearch API
  • Learn to control segments merging and why ElasticSearch uses merging at all
  • Overcome problems with garbage collection, threading, and I/O
  • Improve the user search experience by using ElasticSearch functionality
  • Develop an application using the ElasticSearch Java API and develop custom ElasticSearch plugins

Approach

A practical tutorial that covers the difficult design, implementation, and management of search solutions.

商品描述(中文翻譯)

撰寫給中級使用者的這本教程幫助您利用 Apache Lucene 和 ElasticSearch 的強大功能來優化您的資訊檢索。從設計到實作再到管理,這是一本全方位的指南。

概述
- 了解 Apache Lucene 和 ElasticSearch 的設計與架構,以充分理解這個優秀搜尋引擎的運作方式
- 設計、配置和分配您的索引,並深入了解其背後的運作原理
- 以易於閱讀的書籍學習進階功能,並透過詳細範例幫助您理解和使用 ElasticSearch 的複雜功能

詳細內容
ElasticSearch 是一個快速、分散式、可擴展的搜尋引擎,基於 Java 編寫,利用 Apache Lucene 的能力,提供對如何索引和搜尋即使是最大數據集的新控制層級。

《精通 ElasticSearch》涵蓋了 ElasticSearch 的中級和進階功能,讓您不僅了解 ElasticSearch 的運作方式,還能引導您深入其內部結構,如快取、Apache Lucene 函式庫、監控能力和 Java API。此外,您將看到 ElasticSearch 配置參數的實際使用、監控 API,以及如何通過編寫自己的插件來擴展 ElasticSearch 的易用和可擴展範例。

《精通 ElasticSearch》首先展示了 Apache Lucene 的運作方式以及 ElasticSearch 的架構。它涵蓋了進階查詢能力、索引配置控制、索引分配、ElasticSearch 管理和故障排除。最後,您將看到如何改善用戶的搜尋體驗,使用提供的 Java API 並開發自己的自訂插件。

這本書將幫助您了解 Apache Lucene 在查詢和索引方面的運作。您還將學習如何使用不同的計分模型,使用其他查詢重新計分文件,通過使用自訂的發佈格式來改變索引的寫入方式,以及什麼是段合併,並如何根據您的需求進行配置。您將通過修改查詢以使用過濾器來優化查詢,並了解這為什麼重要。本書詳細描述了如何使用 ElasticSearch 中存在的分片分配機制,例如強制感知。

如果您正在尋找一本能夠輕鬆擴展您對 ElasticSearch 基礎知識的書籍,或者您想更深入了解使用 ElasticSearch 的全文檢索世界,那麼這本書就是為您而寫的。

您將從這本書中學到的內容
- 了解 Apache Lucene 的運作方式
- 使用和配置不同的計分模型以改變預設計分機制
- 利用查詢重新計分來重新計算前 N 個文件的分數
- 為您的部署選擇合適的分片和副本數量
- 明智地使用分片分配並了解其內部運作
- 通過使用不同的發佈格式來改變索引格式
- 利用您的知識創建可擴展、高效且容錯的叢集
- 通過使用和理解 ElasticSearch API 來監控您的叢集
- 學習控制段合併以及為什麼 ElasticSearch 會使用合併
- 克服垃圾回收、執行緒和 I/O 的問題
- 通過使用 ElasticSearch 功能改善用戶搜尋體驗
- 使用 ElasticSearch Java API 開發應用程式並開發自訂的 ElasticSearch 插件

方法
這是一個實用的教程,涵蓋了搜尋解決方案的困難設計、實作和管理。