課程信息
76,715 次近期查看

第 2 門課程(共 4 門)

100% 在線

立即開始,按照自己的計劃學習。

可靈活調整截止日期

根據您的日程表重置截止日期。

中級

Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode

完成時間大約為20 小時

建議:4-6 hours/week...

英語(English)

字幕:英語(English)

您將獲得的技能

Artificial Intelligence (AI)Machine LearningReinforcement LearningFunction ApproximationIntelligent Systems

第 2 門課程(共 4 門)

100% 在線

立即開始,按照自己的計劃學習。

可靈活調整截止日期

根據您的日程表重置截止日期。

中級

Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode

完成時間大約為20 小時

建議:4-6 hours/week...

英語(English)

字幕:英語(English)

教學大綱 - 您將從這門課程中學到什麼

1
完成時間為 1 小時

Welcome to the Course!

2 個視頻 (總計 10 分鐘), 2 個閱讀材料
2 個視頻
Meet your instructors!8分鐘
2 個閱讀材料
Reinforcement Learning Textbook10分鐘
Read Me: Pre-requisites and Learning Objectives10分鐘
2
完成時間為 3 小時

Monte Carlo Methods for Prediction & Control

11 個視頻 (總計 58 分鐘), 2 個閱讀材料, 1 個測驗
11 個視頻
Using Monte Carlo for Prediction6分鐘
Using Monte Carlo for Action Values2分鐘
Using Monte Carlo methods for generalized policy iteration2分鐘
Solving the Blackjack Example3分鐘
Epsilon-soft policies5分鐘
Why does off-policy learning matter?4分鐘
Importance Sampling4分鐘
Off-Policy Monte Carlo Prediction5分鐘
Emma Brunskill: Batch Reinforcement Learning12分鐘
Week 1 Summary3分鐘
2 個閱讀材料
Weekly Reading40分鐘
Chapter Summary40分鐘
1 個練習
Graded Quiz
3
完成時間為 6 小時

Temporal Difference Learning Methods for Prediction

6 個視頻 (總計 37 分鐘), 1 個閱讀材料, 2 個測驗
6 個視頻
Rich Sutton: The Importance of TD Learning6分鐘
The advantages of temporal difference learning5分鐘
Comparing TD and Monte Carlo5分鐘
Andy Barto and Rich Sutton: More on the History of RL12分鐘
Week 2 Summary2分鐘
1 個閱讀材料
Weekly Reading40分鐘
1 個練習
Practice Quiz30分鐘
4
完成時間為 8 小時

Temporal Difference Learning Methods for Control

9 個視頻 (總計 30 分鐘), 2 個閱讀材料, 2 個測驗
9 個視頻
Sarsa in the Windy Grid World3分鐘
What is Q-learning?3分鐘
Q-learning in the Windy Grid World3分鐘
How is Q-learning off-policy?4分鐘
Expected Sarsa3分鐘
Expected Sarsa in the Cliff World3分鐘
Generality of Expected Sarsa1分鐘
Week 3 Summary2分鐘
2 個閱讀材料
Weekly Reading40分鐘
Chapter summary40分鐘
1 個練習
Practice Quiz18分鐘
4.8
39 條評論Chevron Right

來自Sample-based Learning Methods的熱門評論

創建者 KNOct 3rd 2019

Great course! The notebooks are a perfect level of difficulty for someone learning RL for the first time. Thanks Martha and Adam for all your work on this!! Great content!!

創建者 UZNov 23rd 2019

Good balance of theory and programming assignments. I really like the weekly bonus videos with professors and developers. Recommend to everyone.

講師

Avatar

Martha White

Assistant Professor
Computing Science
Avatar

Adam White

Assistant Professor
Computing Science

關於 阿尔伯塔大学

UAlberta is considered among the world’s leading public research- and teaching-intensive universities. As one of Canada’s top universities, we’re known for excellence across the humanities, sciences, creative arts, business, engineering and health sciences....

關於 Alberta Machine Intelligence Institute

The Alberta Machine Intelligence Institute (Amii) is home to some of the world’s top talent in machine intelligence. We’re an Alberta-based research institute that pushes the bounds of academic knowledge and guides business understanding of artificial intelligence and machine learning....

關於 强化学习 專項課程

The Reinforcement Learning Specialization consists of 4 courses exploring the power of adaptive learning systems and artificial intelligence (AI). Harnessing the full potential of artificial intelligence requires adaptive learning systems. Learn how Reinforcement Learning (RL) solutions help solve real-world problems through trial-and-error interaction by implementing a complete RL solution from beginning to end. By the end of this Specialization, learners will understand the foundations of much of modern probabilistic artificial intelligence (AI) and be prepared to take more advanced courses or to apply AI tools and ideas to real-world problems. This content will focus on “small-scale” problems in order to understand the foundations of Reinforcement Learning, as taught by world-renowned experts at the University of Alberta, Faculty of Science. The tools learned in this Specialization can be applied to game development (AI), customer interaction (how a website interacts with customers), smart assistants, recommender systems, supply chain, industrial control, finance, oil & gas pipelines, industrial control systems, and more....
强化学习

常見問題

  • 注册以便获得证书后,您将有权访问所有视频、测验和编程作业(如果适用)。只有在您的班次开课之后,才可以提交和审阅同学互评作业。如果您选择在不购买的情况下浏览课程,可能无法访问某些作业。

  • 您注册课程后,将有权访问专项课程中的所有课程,并且会在完成课程后获得证书。您的电子课程证书将添加到您的成就页中,您可以通过该页打印您的课程证书或将其添加到您的领英档案中。如果您只想阅读和查看课程内容,可以免费旁听课程。

還有其他問題嗎?請訪問 學生幫助中心