Handbook of Learning and Approximate Dynamic Programming
暫譯: 學習與近似動態規劃手冊

Jennie Si, Andrew G. Barto, Warren Buckler Powell, Don Wunsch

  • 出版商: Wiley
  • 出版日期: 2004-08-02
  • 定價: $6,600
  • 售價: 9.5$6,270
  • 語言: 英文
  • 頁數: 672
  • 裝訂: Hardcover
  • ISBN: 047166054X
  • ISBN-13: 9780471660545
  • 相關分類: 人工智慧控制系統 Control-systems
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Description:

Approximate dynamic programming solves decision and control problems

While advances in science and engineering have enabled us to design and build complex systems, how to control and optimize them remains a challenge. This was made clear, for example, by the major power outage across dozens of cities in the Eastern United States and Canada in August of 2003. Learning and approximate dynamic programming (ADP) is emerging as one of the most promising mathematical and computational approaches to solve nonlinear, large-scale, dynamic control problems under uncertainty. It draws heavily both on rigorous mathematics and on biological inspiration and parallels, and helps unify new developments across many disciplines.

The foundations of learning and approximate dynamic programming have evolved from several fields–optimal control, artificial intelligence (reinforcement learning), operations research (dynamic programming), and stochastic approximation methods (neural networks). Applications of these methods span engineering, economics, business, and computer science. In this volume, leading experts in the field summarize the latest research in areas including:

  • Reinforcement learning and its relationship to supervised learning
  • Model-based adaptive critic designs
  • Direct neural dynamic programming
  • Hierarchical decision-making
  • Multistage stochastic linear programming for resource allocation problems
  • Concurrency, multiagency, and partial observability
  • Backpropagation through time and derivative adaptive critics
  • Applications of approximate dynamic programming and reinforcement learning in control-constrained agile missiles; power systems; heating, ventilation, and air conditioning; helicopter flight control; transportation and more.

商品描述(中文翻譯)

描述:

近似動態規劃解決決策和控制問題

隨著科學和工程的進步,我們能夠設計和建造複雜系統,但如何控制和優化這些系統仍然是一個挑戰。例如,2003年8月美國東部和加拿大數十個城市的大規模停電事件就清楚地顯示了這一點。學習和近似動態規劃(ADP)正逐漸成為解決不確定性下的非線性、大規模動態控制問題的最有前景的數學和計算方法之一。它在嚴謹的數學基礎和生物靈感及其相似性上都有很大的依賴,並有助於統一各個學科的新發展。

學習和近似動態規劃的基礎源自幾個領域——最優控制、人工智慧(強化學習)、運籌學(動態規劃)和隨機逼近方法(神經網絡)。這些方法的應用涵蓋了工程、經濟學、商業和計算機科學。在本書中,該領域的領先專家總結了最新的研究,涵蓋以下領域:

- 強化學習及其與監督學習的關係
- 基於模型的自適應評價設計
- 直接神經動態規劃
- 階層決策
- 用於資源分配問題的多階段隨機線性規劃
- 並行性、多代理和部分可觀察性
- 時間反向傳播和導數自適應評價
- 近似動態規劃和強化學習在控制受限的敏捷導彈、電力系統、暖通空調、直升機飛行控制、交通運輸等方面的應用。