Save for later

Sample-based Learning Methods

Reinforcement Learning,

In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning. By the end of this course you will be able to: - Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience - Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model - Understand the connections between Monte Carlo and Dynamic Programming and TD. - Implement and apply the TD algorithm, for estimating value functions - Implement and apply Expected Sarsa and Q-learning (two TD methods for control) - Understand the difference between on-policy and off-policy control - Understand planning with simulated experience (as opposed to classic planning strategies) - Implement a model-based approach to RL, called Dyna, which uses simulated experience - Conduct an empirical study to see the improvements in sample efficiency when using Dyna

Get Details and Enroll Now

OpenCourser is an affiliate partner of Coursera and may earn a commission when you buy through our links.

Get a Reminder

Send to:
Rating 4.6 based on 58 ratings
Length 5 weeks
Effort 4-6 hours/week
Starts Jul 3 (44 weeks ago)
Cost $99
From University of Alberta, Alberta Machine Intelligence Institute via Coursera
Instructors Martha White, Adam White
Download Videos On all desktop and mobile devices
Language English
Subjects Programming Data Science
Tags Computer Science Data Science Algorithms Machine Learning

Get a Reminder

Send to:

Similar Courses

What people are saying

balance between theory

A great step towards the acquisition of basic and medium complexity RL concepts with a nice balance between theory and practice, similar to the first one.

Great balance between theory and demonstration of how all techniques works.

Read more

theory and practice

Well done mix of theory and practice!

programming assignments

The programming assignments are not really "programming" because you're constrained to type a handful of lines in a few places into a solution that is largely has been prepared for you.

Coursera has classes with more intense and creative programming assignments and the learning there seems to be much deeper.

excellent complement to the book Great The course needs more support and / or error message output for the programming assignments.

Also, the programming assignments are very beneficial.

Very good introductions and practices to the classic RL algorithms Excellent course companion to the textbook, clarifies many of the vague topics and gives good tests to ensure understanding Good balance of theory and programming assignments.

Rating 4.3 stars – so far (first two classes combined) Lectures: 4.0stars Quizes: 4.0stars Programming assignments: 4.5stars Book (Sutton and Barto): 4.5stars In the spectrum from the theoretical to practical where you have, very roughly,... (1) “Why”: Why you are doing what you are doing (2) “What”: What you are doing (3) “How”: How to implement it (eg programming)... ...this is a “what-how” class.

Read more

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile.

Learning Coach/Mentor $51k

Living-Learning Communities Coordinator $52k

Associate Americas Learning Specialist $52k

Adult Learning AmeriCorps Member $55k

Computer Vision, Deep Learning Engineer $67k

Program Manager, Learning and Professional Development $68k

Team Learning & Development Specialist Lead $68k

Learning and Development Corporate Trainer $79k

HR Learning and Development/Training Instructor $87k

Senior Learning Specialist, Learning and Development $102k

Regional Learning Solutions Analyst Consultant $109k

Principal Learning Leader $176k

Write a review

Your opinion matters. Tell us what you think.

Rating 4.6 based on 58 ratings
Length 5 weeks
Effort 4-6 hours/week
Starts Jul 3 (44 weeks ago)
Cost $99
From University of Alberta, Alberta Machine Intelligence Institute via Coursera
Instructors Martha White, Adam White
Download Videos On all desktop and mobile devices
Language English
Subjects Programming Data Science
Tags Computer Science Data Science Algorithms Machine Learning

Similar Courses

Sorted by relevance

Like this course?

Here's what to do next:

  • Save this course for later
  • Get more details from the course provider
  • Enroll in this course
Enroll Now