Vision-Based Interaction (Synthesis Lectures on Computer Vision)
暫譯: 基於視覺的互動（計算機視覺綜合講座）

Name: Vision-Based Interaction (Synthesis Lectures on Computer Vision)
Price: 1815 TWD
Availability: OnlineOnly
Author: Matthew Turk, Gang Hua
ISBN: 1608452417

Matthew Turk, Gang Hua

出版商: Morgan & Claypool
出版日期: 2013-10-01
售價: $1,910
貴賓價: 9.5 折 $1,815
語言: 英文
頁數: 134
裝訂: Paperback
ISBN: 1608452417
ISBN-13: 9781608452415
相關分類: Computer Vision

海外代購書籍(需單獨結帳)

商品描述

In its early years, the field of computer vision was largely motivated by researchers seeking computational models of biological vision and solutions to practical problems in manufacturing, defense, and medicine. For the past two decades or so, there has been an increasing interest in computer vision as an input modality in the context of human-computer interaction. Such vision-based interaction can endow interactive systems with visual capabilities similar to those important to human-human interaction, in order to perceive non-verbal cues and incorporate this information in applications such as interactive gaming, visualization, art installations, intelligent agent interaction, and various kinds of command and control tasks. Enabling this kind of rich, visual and multimodal interaction requires interactive-time solutions to problems such as detecting and recognizing faces and facial expressions, determining a person's direction of gaze and focus of attention, tracking movement of the body, and recognizing various kinds of gestures. In building technologies for vision-based interaction, there are choices to be made as to the range of possible sensors employed (e.g., single camera, stereo rig, depth camera), the precision and granularity of the desired outputs, the mobility of the solution, usability issues, etc. Practical considerations dictate that there is not a one-size-fits-all solution to the variety of interaction scenarios; however, there are principles and methodological approaches common to a wide range of problems in the domain. While new sensors such as the Microsoft Kinect are having a major influence on the research and practice of vision-based interaction in various settings, they are just a starting point for continued progress in the area. In this book, we discuss the landscape of history, opportunities, and challenges in this area of vision-based interaction; we review the state-of-the-art and seminal works in detecting and recognizing the human body and its components; we explore both static and dynamic approaches to ""looking at people"" vision problems; and we place the computer vision work in the context of other modalities and multimodal applications. Readers should gain a thorough understanding of current and future possibilities of computer vision technologies in the context of human-computer interaction.

商品描述(中文翻譯)

在早期，計算機視覺領域主要受到研究者尋求生物視覺的計算模型以及解決製造、國防和醫療等實際問題的驅動。在過去的二十年左右，隨著人機互動的背景下，計算機視覺作為一種輸入方式的興趣日益增加。這種基於視覺的互動可以賦予互動系統類似於人與人之間互動的重要視覺能力，以便感知非語言線索並將這些信息融入到互動遊戲、可視化、藝術裝置、智能代理互動以及各種指揮和控制任務等應用中。實現這種豐富的視覺和多模態互動需要在互動時間內解決一些問題，例如檢測和識別面孔及面部表情、確定一個人的注視方向和注意力焦點、追蹤身體運動以及識別各種手勢。在構建基於視覺的互動技術時，需要在所使用的傳感器範圍（例如，單鏡頭、立體攝影機、深度攝影機）、所需輸出的精確度和粒度、解決方案的移動性、可用性問題等方面做出選擇。實際考量表明，對於各種互動場景並不存在一種通用的解決方案；然而，在該領域的廣泛問題中，有一些共同的原則和方法論方法。雖然像 Microsoft Kinect 這樣的新型傳感器對於各種環境中的基於視覺的互動研究和實踐產生了重大影響，但它們僅僅是該領域持續進步的起點。在本書中，我們討論了基於視覺的互動領域的歷史背景、機會和挑戰；我們回顧了檢測和識別人體及其組成部分的最新技術和開創性工作；我們探討了靜態和動態的“觀察人類”視覺問題的方法；並將計算機視覺的工作置於其他模態和多模態應用的背景中。讀者應該能夠全面了解計算機視覺技術在人體互動中的當前和未來可能性。