Handbook of Learning and Approximate Dynamic Programming
暫譯: 學習與近似動態規劃手冊

Name: Handbook of Learning and Approximate Dynamic Programming
Price: 6270 TWD
Availability: InStock
Author: Jennie Si, Andrew G. Barto, Warren Buckler Powell, Don Wunsch
ISBN: 047166054X

Jennie Si, Andrew G. Barto, Warren Buckler Powell, Don Wunsch

出版商: Wiley
出版日期: 2004-08-02
售價: $6,600
貴賓價: 9.5 折 $6,270
語言: 英文
頁數: 672
裝訂: Hardcover
ISBN: 047166054X
ISBN-13: 9780471660545
相關分類: Reinforcement

立即出貨 (庫存=1)

買這商品的人也買了...

~~$1,200~~ $1,176

Optical Networks: A Practical Perspective, 2/e
~~$650~~ $514

Visual C#.NET 程式設計經典
~~$1,950~~ $1,853

Writing Secure Code, 2/e (Paperback)
~~$780~~ $741

作業系統概念 (Operating System Concepts, 6/e Windows XP Update)
~~$980~~ $960

Network Systems Design Using Network Processors (Paperback)
~~$590~~ $466

ASP.NET 程式設計徹底研究
~~$490~~ $387

軟體設計與品質管理
~~$690~~ $538

STRUTS 實作手冊(Struts in Action: Building Web Applications with the Leading Java Framework)
$990

Beginning Visual C++ 6 (Paperback)
~~$550~~ $468

osCommerce 購物網站架設實戰
~~$490~~ $417

Dreamweaver MX 2004 魔法書中文版
~~$4,540~~ $4,313

Handbook of Digital Techniques for High-Speed Design: Design Examples, Signaling and Memory Technologies, Fiber Optics, Modeling and Simulation to Ensure Signal Integrity
~~$1,200~~ $1,176

Computer Organization and Design: The Hardware/Software Interface, 3/e(IE) (美國版ISBN:1558606041)
~~$760~~ $646

Flash ActionScript 2.0 RIA 應用程式開發
~~$890~~ $703

Windows CE.NET 程式設計 (Programming Microsoft Windows CE .Net, 3/e)
~~$650~~ $553

Linux 指令詳解辭典
~~$650~~ $507

ASP.NET 徹底研究進階技巧─高階技巧與控制項實作
~~$1,700~~ $1,666

CCNA Cisco Certified Network Associate Study Guide, 5/e (640-801)
~~$880~~ $748

Head First Servlets & JSP：SCWCD 專業認證指南 (Head First Servlets & JSP)
~~$580~~ $493

打造個性化 XOOPS2 網站─佈景設計、模組開發
~~$2,500~~ $2,375

WiMAX Handbook
~~$420~~ $328

架設我的部落格王國－plog 建構網誌與像簿
~~$680~~ $537

PHP + MySQL 快速入門
~~$450~~ $356

Ajax 技術手冊 (Foundations of Ajax)
~~$490~~ $387

程式之美－微軟技術面試心得

商品描述

Description:

Approximate dynamic programming solves decision and control problems

While advances in science and engineering have enabled us to design and build complex systems, how to control and optimize them remains a challenge. This was made clear, for example, by the major power outage across dozens of cities in the Eastern United States and Canada in August of 2003. Learning and approximate dynamic programming (ADP) is emerging as one of the most promising mathematical and computational approaches to solve nonlinear, large-scale, dynamic control problems under uncertainty. It draws heavily both on rigorous mathematics and on biological inspiration and parallels, and helps unify new developments across many disciplines.

The foundations of learning and approximate dynamic programming have evolved from several fields–optimal control, artificial intelligence (reinforcement learning), operations research (dynamic programming), and stochastic approximation methods (neural networks). Applications of these methods span engineering, economics, business, and computer science. In this volume, leading experts in the field summarize the latest research in areas including:

Reinforcement learning and its relationship to supervised learning

Model-based adaptive critic designs

Direct neural dynamic programming

Hierarchical decision-making

Multistage stochastic linear programming for resource allocation problems

Concurrency, multiagency, and partial observability

Backpropagation through time and derivative adaptive critics

Applications of approximate dynamic programming and reinforcement learning in control-constrained agile missiles; power systems; heating, ventilation, and air conditioning; helicopter flight control; transportation and more.

商品描述(中文翻譯)

描述：

近似動態規劃解決決策和控制問題

隨著科學和工程的進步，我們能夠設計和建造複雜系統，但如何控制和優化這些系統仍然是一個挑戰。例如，2003年8月美國東部和加拿大數十個城市的大規模停電事件就清楚地顯示了這一點。學習和近似動態規劃（ADP）正逐漸成為解決不確定性下的非線性、大規模動態控制問題的最有前景的數學和計算方法之一。它在嚴謹的數學基礎和生物靈感及其相似性上都有很大的依賴，並有助於統一各個學科的新發展。

學習和近似動態規劃的基礎源自幾個領域——最優控制、人工智慧（強化學習）、運籌學（動態規劃）和隨機逼近方法（神經網絡）。這些方法的應用涵蓋了工程、經濟學、商業和計算機科學。在本書中，該領域的領先專家總結了最新的研究，涵蓋以下領域：

- 強化學習及其與監督學習的關係
- 基於模型的自適應評價設計
- 直接神經動態規劃
- 階層決策
- 用於資源分配問題的多階段隨機線性規劃
- 並行性、多代理和部分可觀察性
- 時間反向傳播和導數自適應評價
- 近似動態規劃和強化學習在控制受限的敏捷導彈、電力系統、暖通空調、直升機飛行控制、交通運輸等方面的應用。