📄 http:^^www.cs.wisc.edu^~finton^rlpage.html
字号:
Date: Tue, 05 Nov 1996 21:58:40 GMTServer: NCSA/1.5Content-type: text/htmlLast-modified: Fri, 25 Oct 1996 18:21:39 GMTContent-length: 2170<html><head><title>DJF's Reinforcement Learning Page</title></head><body><h1> Some Reinforcement Learning Resources</h1><p><h2> <!WA0><!WA0><!WA0><!WA0><!WA0><a href="http://www.cs.wisc.edu/~finton/djfpubs.html"> My publications</a></h2><p><h2> Short subjects:</h2><ul><li> <!WA1><!WA1><!WA1><!WA1><!WA1><a href="http://www.cs.wisc.edu/~finton/what-rl.html"> What is reinforcement learning, and why is it hard?</a><li> <!WA2><!WA2><!WA2><!WA2><!WA2><a href="http://www.cs.wisc.edu/~finton/ibfe.html"> What is Importance-Based Feature Extraction?</a></ul><p><h2> Simulation code for several control problems:</h2><ul><li> <!WA3><!WA3><!WA3><!WA3><!WA3><a href="http://www.cs.wisc.edu/~finton/poledriver.html"> Pole-cart problem, driver module</a>---Just the driver; supply your own controller<li> <!WA4><!WA4><!WA4><!WA4><!WA4><a href="http://www.cs.wisc.edu/~finton/qcontroller.html"> Sample Q-learning controller module</a>---Suitable for use with the pole-cart driver module. (Currently, doesn't use probabilistic action selection).<li> <!WA5><!WA5><!WA5><!WA5><!WA5><a href="ftp://ftp.cs.umass.edu/pub/anw/pub/sutton/pole.c"> Barto-Sutton-Anderson pole-cart solution</a><li> <!WA6><!WA6><!WA6><!WA6><!WA6><a href="http://www.cs.colostate.edu/~anderson/#software"> Chuck Anderson's public domain code for neural networks and reinforcement learning</a><li> <em>Suggestions on additional links?</em></ul><p><h2> Other RL resources:</h2><ul><li> <!WA7><!WA7><!WA7><!WA7><!WA7><a href="http://envy.cs.umass.edu/People/sutton/RLinterface/RLinterface.html"> Proposed Standard for Reinforcement Learning Software</a>, by Rich Sutton and Juan Carlos Santamaria<li> <!WA8><!WA8><!WA8><!WA8><!WA8><a href="ftp://archive.cis.ohio-state.edu/pub/neuroprose/"> NeuroProse Archive (Ohio State University)</a><li> <!WA9><!WA9><!WA9><!WA9><!WA9><a href="ftp://ftp.gmd.de/Learning/rl/"> GMD Reinforcement Learning Archive</a><li> <!WA10><!WA10><!WA10><!WA10><!WA10><a href="http://envy.cs.umass.edu/People/sutton/sutton.html"> Rich Sutton's</a> home page and RL archive<li> <!WA11><!WA11><!WA11><!WA11><!WA11><a href="http://www.idiap.ch/html/idiap-networks.html"> IDIAP Neural Network Home Page, including links to conferences</a></ul><p><hr><!WA12><!WA12><!WA12><!WA12><!WA12><a href="http://www.cs.wisc.edu/~finton/finton.html">David J. Finton</a>, <!WA13><!WA13><!WA13><!WA13><!WA13><a href="mailto:finton@cs.wisc.edu"><em>finton@cs.wisc.edu</em></a>, October 25, 1996.</body></html>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -