readme.txt

来自「approximate reinforcement learning」· 文本 代码 · 共 8 行

TXT
8
字号
/int_mdp, /int_problem
	Problem and MDP definition of a simple discrete-time integrator
	 for approximate RL
/int_optimal
	RL solution: optimal Q-function and policy
/int_plotoptimal
	Script to plot a nice picture of the Q-function and lay policy on
	top of it.

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?