Reinforcement Learning: Theory and Python Implementation

Xiao, Zhiqing

  • 出版商: Springer
  • 出版日期: 2024-09-29
  • 售價: $3,490
  • 貴賓價: 9.5$3,316
  • 語言: 英文
  • 頁數: 559
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 9811949328
  • ISBN-13: 9789811949326
  • 相關分類: Python程式語言ReinforcementDeepLearning
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Reinforcement Learning: Theory and Python Implementation is a tutorial book on reinforcement learning, with explanations of both theory and applications. Starting from a uniform mathematical framework, this book derives the theory of modern reinforcement learning in a systematic way and introduces all mainstream reinforcement learning algorithms including both classical reinforcement learning algorithms such as eligibility trace and deep reinforcement learning algorithms such as PPO, SAC, and MuZero. Every chapter is accompanied by high-quality implementations based on the latest version of Python packages such as Gym, and the implementations of deep reinforcement learning algorithms are all with both TensorFlow 2 and PyTorch 1. All codes can be found on GitHub along with their results and are runnable on a conventional laptop with either Windows, macOS, or Linux.

This book is intended for readers who want to learn reinforcement learning systematically and applyreinforcement learning to practical applications. It is also ideal to academical researchers who seek theoretical foundation or algorithm enhancement in their cutting-edge AI research.

商品描述(中文翻譯)

《強化學習:理論與 Python 實作》是一本關於強化學習的教程書,涵蓋了理論與應用的解釋。從統一的數學框架出發,本書以系統化的方式推導現代強化學習的理論,並介紹所有主流的強化學習演算法,包括經典的強化學習演算法如 eligibility trace,以及深度強化學習演算法如 PPO、SAC 和 MuZero。每一章都附有基於最新版本 Python 套件(如 Gym)的高品質實作,深度強化學習演算法的實作則同時支援 TensorFlow 2 和 PyTorch 1。所有程式碼均可在 GitHub 上找到,並附有其結果,且可在搭載 Windows、macOS 或 Linux 的一般筆記型電腦上運行。

本書適合希望系統性學習強化學習並將其應用於實際應用的讀者。對於尋求理論基礎或演算法增強的學術研究者來說,這本書也是理想的選擇,特別是在他們的前沿 AI 研究中。

作者簡介

Zhiqing Xiao obtained doctoral degree from Tsinghua University in 2016 and has more than 15 years in academic research and industrial practices on data-analytics and AI. He is the author of two AI bestsellers in Chinese: "Reinforcement Learning" and "Application of Neural Network and PyTorch" and published many academic papers. He also contributed to recent versions of the open-source software Gym.


作者簡介(中文翻譯)

Zhiqing Xiao於2016年獲得清華大學博士學位,擁有超過15年的數據分析和人工智慧的學術研究及產業實踐經驗。他是兩本中文人工智慧暢銷書的作者:《強化學習》和《神經網絡與PyTorch的應用》,並發表了多篇學術論文。他也對最近版本的開源軟體Gym做出了貢獻。