readme.txt
来自「approximate reinforcement learning」· 文本 代码 · 共 8 行
TXT
8 行
/int_mdp, /int_problem
Problem and MDP definition of a simple discrete-time integrator
for approximate RL
/int_optimal
RL solution: optimal Q-function and policy
/int_plotoptimal
Script to plot a nice picture of the Q-function and lay policy on
top of it.
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?