Reward design

Loading...
查看授課大綱

審閱

4.1(276 個評分)
  • 5 stars
    149 ratings
  • 4 stars
    69 ratings
  • 3 stars
    26 ratings
  • 2 stars
    12 ratings
  • 1 star
    20 ratings
LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

從本節課中
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

教學方

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

探索我們的目錄

免費加入並獲得個性化推薦、更新和優惠。