Reward design

Loading...
查看授課大綱

審閱

4.1(341 個評分)
  • 5 stars
    54%
  • 4 stars
    25%
  • 3 stars
    9%
  • 2 stars
    5%
  • 1 star
    7%
LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

HH

Jan 29, 2020

Very practical lecture. I strongly recommend this lecture. Programming assignments are little difficult, but not impossible :) Just do it!

從本節課中
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

教學方

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

探索我們的目錄

免費加入並獲得個性化推薦、更新和優惠。