Policy gradient formalism

video-placeholder
Loading...
查看授課大綱

審閱

4.2(431 個評分)
  • 5 stars
    58.46%
  • 4 stars
    22.96%
  • 3 stars
    9.04%
  • 2 stars
    4.17%
  • 1 star
    5.33%
SF
2020年4月8日

At times it felt like a bit more video material would be helpful to better understand the subject/gain deeper understanding.\n\nAnd fixing some of the notebooks would be helpful.

FZ
2019年2月13日

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

從本節課中
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

教學方

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

探索我們的目錄

免費加入並獲得個性化推薦、更新和優惠。