Symbol Spotting in Digital Libraries: Focused Retrieval over Graphic-rich Document Collections
暫譯: 數位圖書館中的符號識別:針對圖形豐富文件集合的精確檢索

Marçal Rusiñol

  • 出版商: Springer
  • 出版日期: 2014-12-03
  • 售價: $4,600
  • 貴賓價: 9.5$4,370
  • 語言: 英文
  • 頁數: 196
  • 裝訂: Paperback
  • ISBN: 1447161793
  • ISBN-13: 9781447161790
  • 海外代購書籍(需單獨結帳)

商品描述

Pattern recognition basically deals with the recognition of patterns, shapes, objects, things in images. Document image analysis was one of the very ?rst applications of pattern recognition and even of computing. But until the 1980s, research in this ?eld was mainly dealing with text-based documents, including OCR (Optical Character Recognition) and page layout analysis. Only a few people were looking at more speci?c documents such as music sheet, bank cheques or forms. The community of graphics recognition became visible in the late 1980s. Their speci?c interest was to recognize high-level objects represented by line drawings and graphics. The speci?c pattern recognition problems they had to deal with was raster-to-graphics conversion (i.e., recognizing graphical primitives in a cluttered pixel image), text-graphics separation, and symbol recognition. The speci?c problem of symbol recognition in graphical documents has received a lot of attention. The symbols to be recognized can be musical notation, electrical symbols, architectural objects, pictograms in maps, etc. At ?rst glance, the symbol recognition problems seems to be very similar to that of character recognition; - ter all, characters are basically a subset of symbols. Therefore, the large know-how in OCR has been extensively used in graphical symbol recognition: starting with segmenting the document to extract the symbols, extracting features from the s- bols, and then recognizing them through classi?cation or matching, with respect to a training/learning set.

商品描述(中文翻譯)

模式識別基本上處理的是圖像中模式、形狀、物體和事物的識別。文件圖像分析是模式識別甚至計算的最早應用之一。但直到1980年代,這個領域的研究主要集中在基於文本的文件上,包括光學字符識別(OCR)和頁面佈局分析。只有少數人關注更具體的文件,例如樂譜、銀行支票或表單。圖形識別社群在1980年代末變得顯而易見。他們的具體興趣是識別由線條繪製和圖形表示的高級物體。他們必須處理的具體模式識別問題包括光柵到圖形的轉換(即在雜亂的像素圖像中識別圖形原件)、文本與圖形的分離,以及符號識別。在圖形文件中的符號識別問題受到了很多關注。需要識別的符號可以是音樂符號、電氣符號、建築物件、地圖中的圖示等。乍一看,符號識別問題似乎與字符識別非常相似;畢竟,字符基本上是符號的一個子集。因此,OCR中的大量專業知識被廣泛應用於圖形符號識別:從分割文件以提取符號、從符號中提取特徵,然後通過分類或匹配來識別它們,這些都是基於訓練/學習集進行的。