1. Motivating example2. The simplest MC-based RL algorithm3. Use data more efficiently4. MC without exploring starts参考文献本文是一篇学习笔记,内容全部源自于以下视频https://www.bilibili.com/video/BV1Pz5C6iE3X/?p=6&spm_id_from=333.1007.top_right_bar_window_history.content.click&vd_source=44ed90827c8f67247cab0ab288133c80