An Algorithmic Perspective on Imitation Learning (Foundations and Trends(r) in Robotics)
暫譯: 模仿學習的演算法觀點(機器人學的基礎與趨勢)

Takayuki Osa, Joni Pajarinen, Gerhard Neumann

相關主題

商品描述

As robots and other intelligent agents move from simple environments and problems to more complex, unstructured settings, manually programming their behavior has become increasingly challenging and expensive. Often, it is easier for a teacher to demonstrate a desired behavior rather than attempt to manually engineer it. This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.

An Algorithmic Perspective on Imitation Learning provides the reader with an introduction to imitation learning. It covers the underlying assumptions, approaches, and how they relate; the rich set of algorithms developed to tackle the problem; and advice on effective tools and implementation.

An Algorithmic Perspective on Imitation Learning serves two audiences. First, it familiarizes machine learning experts with the challenges of imitation learning, particularly those arising in robotics, and the interesting theoretical and practical distinctions between it and more familiar frameworks like statistical supervised learning theory and reinforcement learning. Second, it provides roboticists and experts in applied artificial intelligence with a broader appreciation for the frameworks and tools available for imitation learning. It pays particular attention to the intimate connection between imitation learning approaches and those of structured prediction.

商品描述(中文翻譯)

隨著機器人和其他智能代理從簡單的環境和問題轉向更複雜、非結構化的設置,手動編程它們的行為變得越來越具挑戰性和昂貴。通常,對於教師來說,展示所需的行為比試圖手動設計它更容易。這種從示範中學習的過程,以及研究實現此過程的算法,稱為模仿學習。

《模仿學習的算法視角》為讀者提供了模仿學習的介紹。它涵蓋了基本假設、方法及其相互關係;為解決該問題而開發的豐富算法集;以及有效工具和實施的建議。

《模仿學習的算法視角》服務於兩個受眾。首先,它使機器學習專家熟悉模仿學習的挑戰,特別是在機器人技術中出現的挑戰,以及它與更熟悉的框架(如統計監督學習理論和強化學習)之間有趣的理論和實踐區別。其次,它為機器人學家和應用人工智慧專家提供了對模仿學習可用框架和工具的更廣泛理解。它特別關注模仿學習方法與結構化預測方法之間的密切聯繫。