Approximate Dynamic Programming: Solving the Curses of Dimensionality, 2/e (Hardcover)
暫譯: 近似動態規劃:解決維度詛咒,第2版 (精裝本)

Warren B. Powell

  • 出版商: Wiley
  • 出版日期: 2011-09-27
  • 售價: $5,270
  • 貴賓價: 9.5$5,007
  • 語言: 英文
  • 頁數: 606
  • 裝訂: Hardcover
  • ISBN: 047060445X
  • ISBN-13: 9780470604458
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

相關主題

商品描述

Praise for the First Edition

"Finally, a book devoted to dynamic programming and written using the language of operations research (OR)! This beautiful book fills a gap in the libraries of OR specialists and practitioners."
Computing Reviews

This new edition showcases a focus on modeling and computation for complex classes of approximate dynamic programming problems

Understanding approximate dynamic programming (ADP) is vital in order to develop practical and high-quality solutions to complex industrial problems, particularly when those problems involve making decisions in the presence of uncertainty. Approximate Dynamic Programming, Second Edition uniquely integrates four distinct disciplines—Markov decision processes, mathematical programming, simulation, and statistics—to demonstrate how to successfully approach, model, and solve a wide range of real-life problems using ADP.

The book continues to bridge the gap between computer science, simulation, and operations research and now adopts the notation and vocabulary of reinforcement learning as well as stochastic search and simulation optimization. The author outlines the essential algorithms that serve as a starting point in the design of practical solutions for real problems. The three curses of dimensionality that impact complex problems are introduced and detailed coverage of implementation challenges is provided. The Second Edition also features:

  • A new chapter describing four fundamental classes of policies for working with diverse stochastic optimization problems: myopic policies, look-ahead policies, policy function approximations, and policies based on value function approximations

  • A new chapter on policy search that brings together stochastic search and simulation optimization concepts and introduces a new class of optimal learning strategies

  • Updated coverage of the exploration exploitation problem in ADP, now including a recently developed method for doing active learning in the presence of a physical state, using the concept of the knowledge gradient

  • A new sequence of chapters describing statistical methods for approximating value functions, estimating the value of a fixed policy, and value function approximation while searching for optimal policies

The presented coverage of ADP emphasizes models and algorithms, focusing on related applications and computation while also discussing the theoretical side of the topic that explores proofs of convergence and rate of convergence. A related website features an ongoing discussion of the evolving fields of approximation dynamic programming and reinforcement learning, along with additional readings, software, and datasets.

Requiring only a basic understanding of statistics and probability, Approximate Dynamic Programming, Second Edition is an excellent book for industrial engineering and operations research courses at the upper-undergraduate and graduate levels. It also serves as a valuable reference for researchers and professionals who utilize dynamic programming, stochastic programming, and control theory to solve problems in their everyday work.

商品描述(中文翻譯)

對於第一版的讚譽

"終於有一本專注於動態規劃並使用運籌學(OR)語言撰寫的書籍!這本美麗的書填補了運籌學專家和從業者圖書館中的一個空白。"

Computing Reviews

這一新版專注於複雜類別的近似動態規劃問題的建模和計算

理解近似動態規劃(ADP)對於開發實用且高品質的解決方案以應對複雜的工業問題至關重要,特別是當這些問題涉及在不確定性下做出決策時。近似動態規劃,第二版獨特地整合了四個不同的學科——馬可夫決策過程、數學規劃、模擬和統計——以展示如何成功地接近、建模和解決各種現實問題,使用ADP。

本書繼續彌合計算機科學、模擬和運籌學之間的鴻溝,並現在採用強化學習、隨機搜索和模擬優化的符號和詞彙。作者概述了作為設計實際解決方案起點的基本算法。介紹了影響複雜問題的三個維度詛咒,並詳細說明了實施挑戰。第二版還包含:



  • 一章新內容描述了四種基本政策類別,用於處理多樣的隨機優化問題:短視政策、前瞻政策、政策函數近似和基於價值函數近似的政策




  • 一章新內容關於政策搜索,將隨機搜索和模擬優化概念結合在一起,並介紹了一類新的最佳學習策略




  • 更新了ADP中探索與利用問題的內容,現在包括一種最近開發的方法,用於在物理狀態下進行主動學習,使用知識梯度的概念




  • 一系列新章節描述了近似價值函數的統計方法、估計固定政策的價值以及在尋找最佳政策時的價值函數近似



所呈現的ADP內容強調模型和算法,專注於相關應用和計算,同時也討論了該主題的理論側面,探索收斂性和收斂速率的證明。相關網站提供了對近似動態規劃和強化學習不斷演變領域的持續討論,以及額外的閱讀資料、軟體和數據集。

只需對統計和概率有基本了解,近似動態規劃,第二版是工業工程和運籌學課程的優秀書籍,適合高年級本科生和研究生使用。它也為利用動態規劃、隨機規劃和控制理論解決日常工作中問題的研究人員和專業人士提供了寶貴的參考。