Pomdp Example 1 - Detailed Analysis
This video is part of the Udacity course "Reinforcement Learning". Watch the full course at A brief introduction to Partially Observable Markov Decision Processes ( The location of the can is unobservable. There is a 0.6 chance that the can is at the corridor (world
Photo Gallery















