國立虎尾科技大學 |

Mathematical foundations of reinforcement learning

紀錄類型:	書目-語言資料,印刷品 : Monograph/item
正題名/作者:	Mathematical foundations of reinforcement learning/ by Shiyu Zhao.
作者:	Zhao, Shiyu.
出版者:	Singapore :Springer Nature Singapore : : 2025.,
面頁冊數:	xvi, 275 p. :ill., digital ; : 24 cm.;
Contained By:	Springer Nature eBook
標題:	Reinforcement learning. -
電子資源:	https://doi.org/10.1007/978-981-97-3944-8
ISBN:	9789819739448

Mathematical foundations of reinforcement learning
Zhao, Shiyu.

Mathematical foundations of reinforcement learning[electronic resource] /by Shiyu Zhao. - Singapore :Springer Nature Singapore :2025. - xvi, 275 p. :ill., digital ;24 cm.

1 Basic Concepts -- 2 State Value and Bellman Equation -- 3 Optimal State Value and Bellman Optimality Equation -- 4 Value Iteration and Policy Iteration -- 5 Monte Carlo Learning -- 6 Stochastic Approximation -- 7 Temporal-Difference Learning -- 8 Value Function Approximation -- 9 Policy Gradient -- 10 Actor-Critic Methods.

This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability. The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods. With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.

ISBN: 9789819739448

Standard No.: 10.1007/978-981-97-3944-8doiSubjects--Topical Terms:

815404
Reinforcement learning.

LC Class. No.: Q325.6

Dewey Class. No.: 006.31

Mathematical foundations of reinforcement learning
LDR:02373nam a2200325 a 4500 001 1160600
003 DE-He213
005 20250122120756.0
006 m d
007 cr nn 008maaau
008 251029s2025 si s 0 eng d
020 $a 9789819739448 $q (electronic bk.)
020 $a 9789819739431 $q (paper)
024 7 $a 10.1007/978-981-97-3944-8 $2 doi
035 $a 978-981-97-3944-8
040 $a GP $c GP
041 0 $a eng
050 4 $a Q325.6
072 7 $a UYQ $2 bicssc
072 7 $a COM004000 $2 bisacsh
072 7 $a UYQ $2 thema
082 0 4 $a 006.31 $2 23
090 $a Q325.6 $b .Z63 2025
100 1 $a Zhao, Shiyu. $3 1487669
245 1 0 $a Mathematical foundations of reinforcement learning $h [electronic resource] / $c by Shiyu Zhao.
260 $a Singapore : $c 2025. $b Springer Nature Singapore : $b Imprint: Springer,
300 $a xvi, 275 p. : $b ill., digital ; $c 24 cm.
505 0 $a 1 Basic Concepts -- 2 State Value and Bellman Equation -- 3 Optimal State Value and Bellman Optimality Equation -- 4 Value Iteration and Policy Iteration -- 5 Monte Carlo Learning -- 6 Stochastic Approximation -- 7 Temporal-Difference Learning -- 8 Value Function Approximation -- 9 Policy Gradient -- 10 Actor-Critic Methods.
520 $a This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability. The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods. With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.
650 0 $a Reinforcement learning. $3 815404
650 1 4 $a Artificial Intelligence. $3 646849
650 2 4 $a Machine Learning. $3 1137723
650 2 4 $a Data Science. $3 1174436
650 2 4 $a Multiagent Systems. $3 1228090
710 2 $a SpringerLink (Online service) $3 593884
773 0 $t Springer Nature eBook
856 4 0 $u https://doi.org/10.1007/978-981-97-3944-8
950 $a Computer Science (SpringerNature-11645)