Siraj Raval programs a virtual robot to do some house cleaning using a technique called Monte Carlo Prediction. In typical Siraj fashion he explains what it is, how it works and how to use it for reinforcement learning.
<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>
Notify me of follow-up comments by email.
Notify me of new posts by email.