AUDIO-VISUAL PERSON TRACKING: A PRACTICAL APPROACH
暫譯: 音頻視覺人員追蹤:實用方法
Fotios Talantzis, Aristodemos Pnevmatikakis, Anthony G Constantinides
- 出版商: World Scientific Pub
- 出版日期: 2011-11-30
- 售價: $3,530
- 貴賓價: 9.5 折 $3,354
- 語言: 英文
- 頁數: 209
- 裝訂: Hardcover
- ISBN: 1848165811
- ISBN-13: 9781848165816
海外代購書籍(需單獨結帳)
相關主題
商品描述
This book deals with the creation of the algorithmic backbone that enables a computer to perceive humans in a monitored space. This is performed using the same signals that humans process, i.e., audio and video. Computers reproduce the same type of perception using sensors and algorithms in order to detect and track multiple interacting humans, by way of multiple cues, like bodies, faces or speech. This application domain is challenging, because audio and visual signals are cluttered by both background and foreground objects. First, particle filtering is established as the framework for tracking. Then, audio, visual and also audio-visual tracking systems are separately explained. Each modality is analyzed, starting with sensor configuration, detection for tracker initialization and the trackers themselves. Techniques to fuse the modalities are then considered. Instead of offering a monolithic approach to the tracking problem, this book also focuses on implementation by providing MATLAB code for every presented component. This way, the reader can connect every concept with corresponding code. Finally, the applications of the various tracking systems in different domains are studied.
商品描述(中文翻譯)
本書探討了創建算法骨幹,使計算機能夠在監控空間中感知人類的過程。這是通過使用人類處理的相同信號來實現的,即音頻和視頻。計算機利用傳感器和算法重現相同類型的感知,以檢測和追蹤多個互動的人類,通過多種線索,如身體、面孔或語音。這一應用領域具有挑戰性,因為音頻和視覺信號受到背景和前景物體的干擾。首先,粒子濾波被建立為追蹤的框架。然後,音頻、視覺以及音視頻追蹤系統分別進行解釋。每種模態都進行分析,從傳感器配置、追蹤器初始化的檢測到追蹤器本身。接著考慮融合這些模態的技術。本書不僅提供對追蹤問題的整體解決方案,還專注於實現,為每個呈現的組件提供 MATLAB 代碼。這樣,讀者可以將每個概念與相應的代碼相連接。最後,研究了各種追蹤系統在不同領域的應用。