Note: “The future is independent of the past given the present.”
Markov decision processes
Almost all RL problems can be formalised as MDPs
Useful Links
[Markov Decision Processes - UCL Computer Science (EN)]
[David Silver强化学习公开课中文讲解及实践(CN)]
Note: “The future is independent of the past given the present.”
Almost all RL problems can be formalised as MDPs
Useful Links
[Markov Decision Processes - UCL Computer Science (EN)]
[David Silver强化学习公开课中文讲解及实践(CN)]