語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Mathematical foundations of reinforcement learning
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Mathematical foundations of reinforcement learning/ by Shiyu Zhao.
作者:
Zhao, Shiyu.
出版者:
Singapore :Springer Nature Singapore : : 2025.,
面頁冊數:
xvi, 275 p. :ill., digital ; : 24 cm.;
Contained By:
Springer Nature eBook
標題:
Multiagent Systems. -
電子資源:
https://doi.org/10.1007/978-981-97-3944-8
ISBN:
9789819739448
Mathematical foundations of reinforcement learning
Zhao, Shiyu.
Mathematical foundations of reinforcement learning
[electronic resource] /by Shiyu Zhao. - Singapore :Springer Nature Singapore :2025. - xvi, 275 p. :ill., digital ;24 cm.
1 Basic Concepts -- 2 State Value and Bellman Equation -- 3 Optimal State Value and Bellman Optimality Equation -- 4 Value Iteration and Policy Iteration -- 5 Monte Carlo Learning -- 6 Stochastic Approximation -- 7 Temporal-Difference Learning -- 8 Value Function Approximation -- 9 Policy Gradient -- 10 Actor-Critic Methods.
This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability. The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods. With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.
ISBN: 9789819739448
Standard No.: 10.1007/978-981-97-3944-8doiSubjects--Topical Terms:
1228090
Multiagent Systems.
LC Class. No.: Q325.6
Dewey Class. No.: 006.31
Mathematical foundations of reinforcement learning
LDR
:02373nam a2200325 a 4500
001
1160600
003
DE-He213
005
20250122120756.0
006
m d
007
cr nn 008maaau
008
251029s2025 si s 0 eng d
020
$a
9789819739448
$q
(electronic bk.)
020
$a
9789819739431
$q
(paper)
024
7
$a
10.1007/978-981-97-3944-8
$2
doi
035
$a
978-981-97-3944-8
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
Q325.6
072
7
$a
UYQ
$2
bicssc
072
7
$a
COM004000
$2
bisacsh
072
7
$a
UYQ
$2
thema
082
0 4
$a
006.31
$2
23
090
$a
Q325.6
$b
.Z63 2025
100
1
$a
Zhao, Shiyu.
$3
1487669
245
1 0
$a
Mathematical foundations of reinforcement learning
$h
[electronic resource] /
$c
by Shiyu Zhao.
260
$a
Singapore :
$c
2025.
$b
Springer Nature Singapore :
$b
Imprint: Springer,
300
$a
xvi, 275 p. :
$b
ill., digital ;
$c
24 cm.
505
0
$a
1 Basic Concepts -- 2 State Value and Bellman Equation -- 3 Optimal State Value and Bellman Optimality Equation -- 4 Value Iteration and Policy Iteration -- 5 Monte Carlo Learning -- 6 Stochastic Approximation -- 7 Temporal-Difference Learning -- 8 Value Function Approximation -- 9 Policy Gradient -- 10 Actor-Critic Methods.
520
$a
This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability. The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods. With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.
650
2 4
$a
Multiagent Systems.
$3
1228090
650
2 4
$a
Data Science.
$3
1174436
650
2 4
$a
Machine Learning.
$3
1137723
650
1 4
$a
Artificial Intelligence.
$3
646849
650
0
$a
Reinforcement learning.
$3
815404
710
2
$a
SpringerLink (Online service)
$3
593884
773
0
$t
Springer Nature eBook
856
4 0
$u
https://doi.org/10.1007/978-981-97-3944-8
950
$a
Computer Science (SpringerNature-11645)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入