Managing Gigabytes: Compressing and Indexing Documents and Images, 2/e (Hardcover)
暫譯: 管理千兆位元:文件與影像的壓縮與索引,第二版 (精裝版)
Ian H. Witten, Alistair Moffat, Timothy C. Bell
- 出版商: Morgan Kaufmann
- 出版日期: 1999-05-03
- 售價: $1,078
- 語言: 英文
- 頁數: 550
- 裝訂: Hardcover
- ISBN: 1558605703
- ISBN-13: 9781558605701
-
相關分類:
大數據 Big-data、資料庫、Data Science
下單後立即進貨 (約5~7天)
買這商品的人也買了...
-
$520$343 -
$1,250$1,225 -
$1,029Fundamentals of Data Structures in C
-
$640$608 -
$2,640$2,508 -
$1,570$1,492 -
$1,029Operating Systems: Internals and Design Principles, 4/e
-
$970Introduction to Algorithms, 2/e
-
$1,260Beginning Perl for Bioinformatics (Paperback)
-
$960$912 -
$700Microsoft Visual Basic .NET Step by Step
-
$1,920$1,824 -
$920$727 -
$2,370$2,252 -
$690$538 -
$720$569 -
$750$638 -
$560$476 -
$2,370$2,252 -
$650$514 -
$399CCNP Self-Study: Building Cisco Remote Access Networks (BCRAN), 2/e (Hardcover)
-
$2,370$2,252 -
$399CCNP Self-Study : Building Scalable Cisco Internetworks (BSCI), 2/e
-
$350$277 -
$650$507
相關主題
商品描述
Order This Book | Authors | Contents | Web-Enhanced | Related Titles
"This book is the Bible for anyone who needs to manage large data collections. It's required reading for our search gurus at Infoseek. The authors have done an outstanding job of incorporating and describing the most significant new research in information retrieval over the past five years into this second edition."
Steve Kirsch, Cofounder, Infoseek Corporation
"The new edition of Witten, Moffat, and Bell not only has newer and better text search algorithms but much material on image analysis and joint image/text processing. If you care about search engines, you need this book: it is the only one with full details of how they work. The book is both detailed and enjoyable; the authors have combined elegant writing with top-grade programming."
Michael Lesk, National Science Foundation
"The coverage of compression, file organizations, and indexing techniques for full text and document management systems is unsurpassed. Students, researchers, and practitioners will all benefit from reading this book."
Bruce Croft, Director, Center for Intelligent Information Retrieval at the University of Massachusetts
In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web.
The authors (Ian H. Witten, Alistair Moffat, and Timothy C. Bell) all hold senior faculty positions at leading southern hemisphere universities and have undertaken innovative research in the areas addressed in this book. Collectively, they have authored eight books and over 300 research papers. They also serve on the program committees of many international conferences, including the IEEE Data Compression Conference and the ACM Digital Libraries and Information Retrieval conferences.
PREFACE
1. OVERVIEW
2. TEXT COMPRESSION
3. INDEXING
4. QUERYING
5. INDEX CONSTRUCTION
6. IMAGE COMPRESSION
7. TEXTUAL IMAGES
8. MIXED TEXT AND IMAGES
9. IMPLEMENTATION
10. THE INFORMATION EXPLOSION
A. GUIDE TO THE MG SYSTEM
B. GUIDE TO THE NZDL
REFERENCES
INDEX
The authors' website for the book.
Multimedia Information & Systems
Database
Computer & Communication Networks
商品描述(中文翻譯)
「這本書是任何需要管理大型數據集合的人的聖經。對於我們在 Infoseek 的搜尋專家來說,這是必讀書籍。作者在這第二版中出色地整合並描述了過去五年來信息檢索領域最重要的新研究。」
Steve Kirsch, Infoseek Corporation 共同創辦人
「Witten、Moffat 和 Bell 的新版本不僅擁有更新和更好的文本搜尋演算法,還包含大量有關圖像分析和圖像/文本聯合處理的材料。如果你關心搜尋引擎,你需要這本書:它是唯一詳細說明它們如何運作的書籍。這本書既詳細又有趣;作者將優雅的寫作與一流的程式設計相結合。」
Michael Lesk, 國家科學基金會
「對於全文和文檔管理系統的壓縮、文件組織和索引技術的涵蓋無與倫比。學生、研究人員和從業者都將從閱讀這本書中受益。」
Bruce Croft, 麻省理工學院智能信息檢索中心主任
在這本備受讚譽的Managing Gigabytes的全面更新第二版中,作者 Witten、Moffat 和 Bell 繼續提供無與倫比的最先進技術的涵蓋,專注於數據的壓縮和索引。無論你的領域是什麼,如果你處理大量信息,這本書都是必讀之作——它是權威的理論資源,也是應對最艱難的存儲和訪問挑戰的實用指南。它涵蓋了壓縮和索引的最新發展及其在網路和數字圖書館中的應用。它還詳細介紹了數十種強大的技術,這些技術由 mg 支持,mg 是作者自己用於壓縮、存儲和檢索文本、圖像和文本圖像的系統。mg 的源代碼可在網路上免費獲得。
作者(Ian H. Witten、Alistair Moffat 和 Timothy C. Bell)均在南半球的頂尖大學擔任高級教職,並在本書所涉及的領域進行了創新研究。他們共同著作了八本書和超過 300 篇研究論文。他們還在許多國際會議的程序委員會中任職,包括 IEEE 數據壓縮會議和 ACM 數字圖書館與信息檢索會議。
前言
1. 概述
2. 文本壓縮
3. 索引
4. 查詢
5. 索引建構
6. 圖像壓縮
7. 文本圖像
8. 混合文本和圖像
9. 實作
10. 信息爆炸
A. MG 系統指南
B. NZDL 指南
參考文獻
索引