ECE586RS: MDPs and Reinforcement Learning