int_mdp.m

来自「approximate reinforcement learning」· M 代码 · 共 22 行

22 行

function [xplus, rplus] = int_mdp(m, x, u)
% Implements the discrete-time dynamics of a simple integrator with
% bounded position, controlled in velocity.
%  [XPLUS, RPLUS] = DOUBLEINT_MDP(M, X, U)
%
% This function conforms to the specifications established by SAMPLE_MDP.

% 
% limit torque
u = max(-m.phys.maxu, min(m.phys.maxu, u));

% Compute and bound the next state
xplus = x + m.phys.K * u;
xplus = max(-m.phys.maxx, min(m.phys.maxx, xplus));

% Reward - QR
rplus = -x * m.goal.Q * x - u * m.goal.R * u + ...
    m.goal.zeroreward * (abs(x) <= m.goal.zeroband);


% END FUNCTION int_mpd() RETURNING xplus, rplus ==================================================

int_mdp.m - 源码说明

本页面展示了「approximate reinforcement learning」中的 int_mdp.m 源码文件，采用 M 编程语言编写，共 22 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与reinforcement相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?