Sorry, this page is no longer available
We may earn an affiliate commission when you visit our partners.
Course image
Martha White and Adam White

In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning.

Read more

In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning.

By the end of this course you will be able to:

- Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience

- Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model

- Understand the connections between Monte Carlo and Dynamic Programming and TD.

- Implement and apply the TD algorithm, for estimating value functions

- Implement and apply Expected Sarsa and Q-learning (two TD methods for control)

- Understand the difference between on-policy and off-policy control

- Understand planning with simulated experience (as opposed to classic planning strategies)

- Implement a model-based approach to RL, called Dyna, which uses simulated experience

- Conduct an empirical study to see the improvements in sample efficiency when using Dyna

Enroll now

What's inside

Syllabus

Welcome to the Course!
Welcome to the second course in the Reinforcement Learning Specialization: Sample-Based Learning Methods, brought to you by the University of Alberta, Onlea, and Coursera. In this pre-course module, you'll be introduced to your instructors, and get a flavour of what the course has in store for you. Make sure to introduce yourself to your classmates in the "Meet and Greet" section!
Read more

Traffic lights

Read about what's good
what should give you pause
and possible dealbreakers
Examines model-based and model-free approaches to reinforcement learning
Teaches foundational concepts in RL, including TD learning, Monte Carlo methods, and Dyna
Provides hands-on experience implementing TD and Q-learning algorithms
Taught by experienced RL researchers with a strong track record in the field
Focuses on practical applications, with examples from simulated domains
Assumes some prior knowledge of RL concepts and Python programming

Save this course

Create your own learning path. Save this course to your list so you can find it easily later.
Save

Reviews summary

Foundational sample-based reinforcement learning

According to learners, this course offers a positive and challenging exploration of sample-based reinforcement learning methods including Monte Carlo, Temporal Difference learning, Q-learning, and Dyna. Many students highlight the clear explanations that help build a strong theoretical foundation in these core algorithms. While the course is seen as demanding and requires significant effort, particularly with the coding assignments, reviewers consistently describe the assignments as essential for gaining practical insight and solidifying understanding. Overall, it's considered a valuable part of the specialization for those seeking a deep dive into the fundamentals.
Requires significant time and background.
"This course requires a lot of effort and dedication."
"It's a rigorous and challenging course; be prepared for a steep learning curve."
"Requires a solid understanding of prerequisite knowledge from the prior course."
"Not recommended if you are looking for an easy overview; it's a deep dive."
Assignments deepen understanding through implementation.
"Assignments are hard but they really make you understand the concepts deeply."
"I learned the most from tackling the coding assignments; they make the theory concrete."
"Requires significant effort on the assignments, but it's absolutely essential."
"The assignments are very challenging but ultimately rewarding and key to solidifying learning."
Concepts explained clearly and in depth.
"Provides a really solid foundation on TD and MC methods from first principles..."
"The concepts are explained clearly and build well on prior material."
"I got a deep understanding of the core algorithms discussed."
"Excellent explanation of theory behind the algorithms like Q-learning and Sarsa."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Sample-based Learning Methods with these activities:
Review basic probability and statistics concepts
Refresh your knowledge of probability and statistics to strengthen your foundation for understanding reinforcement learning concepts.
Browse courses on Probability
Show steps
  • Review probability distributions
  • Practice solving basic statistical problems
Review basic linear algebra concepts
Refresh your knowledge of linear algebra to enhance your understanding of the mathematical underpinnings of reinforcement learning.
Browse courses on Linear Algebra
Show steps
  • Review vector spaces and matrices
  • Practice solving systems of linear equations
Participate in online discussion forums
Engage in discussions with peers to clarify concepts, share insights, and get feedback on your understanding of the course material.
Show steps
  • Join online discussion forums
  • Participate in discussions by asking questions
  • Provide thoughtful responses to others' questions
Four other activities
Expand to see all activities and additional details
Show all seven activities
Practice coding with Q-learning and Expected Sarsa
Practice implementing and applying temporal difference learning algorithms to better understand how they work and reinforce your understanding of the concepts.
Browse courses on Q-Learning
Show steps
  • Implement the Q-learning algorithm
  • Implement the Expected Sarsa algorithm
  • Test both algorithms on a simple environment
Compile a list of resources for further learning
Gather and organize a collection of resources such as articles, videos, or websites that can supplement your learning and provide additional insights into the course topics.
Browse courses on Self-Directed Learning
Show steps
  • Search for relevant resources
  • Organize and categorize the resources
  • Share the compilation with others
Explore applications of Monte Carlo and Temporal Difference Learning
Research and learn about practical applications of Monte Carlo and Temporal Difference Learning to gain a broader perspective on their usefulness and potential impact.
Browse courses on Monte Carlo Methods
Show steps
  • Review real-world applications of Monte Carlo methods
  • Explore case studies and examples of Temporal Difference Learning in practice
Create a simulation-based RL project
Develop a project that demonstrates your understanding of simulation-based RL techniques and showcases your ability to apply them to solve a real-world problem.
Browse courses on Project-Based Learning
Show steps
  • Define the problem and environment
  • Design and implement the simulation
  • Apply RL algorithms to the simulated environment
  • Evaluate and analyze the results

Career center

Learners who complete Sample-based Learning Methods will develop knowledge and skills that may be useful to these careers:
Machine Learning Engineer
Machine Learning Engineers are responsible for designing, developing, and deploying machine learning models for a wide range of applications. These models can be used to automate tasks, improve decision-making, and gain insights from data. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that are efficient, accurate, and robust.
Data Scientist
Data Scientists use their knowledge of statistics, machine learning, and data analysis to extract insights from data. They work in a variety of industries, including finance, healthcare, and retail. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to solve complex problems and make better decisions.
Software Engineer
Software Engineers design, develop, and maintain software applications. They work in a variety of industries, including finance, healthcare, and technology. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop software applications that are efficient, reliable, and user-friendly.
Quantitative Analyst
Quantitative Analysts use mathematical and statistical models to analyze financial data and make investment decisions. They work in a variety of financial institutions, including banks, hedge funds, and asset management companies. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to identify investment opportunities and make better decisions.
Operations Research Analyst
Operations Research Analysts use mathematical and statistical models to solve complex problems in a variety of industries, including logistics, manufacturing, and healthcare. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to improve efficiency, reduce costs, and make better decisions.
Business Analyst
Business Analysts use their knowledge of business processes and data analysis to help organizations improve their performance. They work in a variety of industries, including finance, healthcare, and manufacturing. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to improve efficiency, reduce costs, and make better decisions.
Data Engineer
Data Engineers design, build, and maintain data pipelines that collect, store, and process data. They work in a variety of industries, including finance, healthcare, and technology. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to improve the efficiency and accuracy of data pipelines.
Product Manager
Product Managers are responsible for the development and launch of new products and features. They work in a variety of industries, including technology, finance, and healthcare. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and launch products that meet the needs of users and achieve business goals.
Risk Analyst
Risk Analysts identify, assess, and mitigate risks for organizations. They work in a variety of industries, including finance, insurance, and healthcare. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to identify and mitigate risks.
Consultant
Consultants provide advice and guidance to organizations on a variety of topics, including business strategy, operations, and technology. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to solve complex problems and make better decisions.
Researcher
Researchers conduct original research in a variety of fields, including science, engineering, and medicine. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to solve complex problems and make new discoveries.
Teacher
Teachers educate students in a variety of subjects, including science, math, and social studies. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deliver lesson plans that are engaging and effective.
Policy Analyst
Policy Analysts develop and evaluate policies for governments and organizations. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and evaluate policies that are effective and evidence-based.
Financial Analyst
Financial Analysts analyze financial data to make investment recommendations. They work in a variety of financial institutions, including banks, hedge funds, and asset management companies. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to identify investment opportunities and make better decisions.
Actuary
Actuaries use mathematical and statistical models to assess and manage risk for insurance companies and other financial institutions. The University of Alberta's Sample-based Learning Methods course can provide you with the skills you need to succeed in this role by teaching you about the fundamental concepts of machine learning, including temporal difference learning, Monte Carlo methods, and model-based reinforcement learning. This knowledge will help you to develop and deploy machine learning models that can be used to assess and manage risk more effectively.

Reading list

We've selected nine books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Sample-based Learning Methods.
Provides a comprehensive overview of deep reinforcement learning, covering both the theoretical foundations and practical applications. It valuable resource for anyone interested in learning more about this field.
Provides a comprehensive overview of reinforcement learning, covering both the theoretical foundations and practical applications. It valuable resource for anyone interested in learning more about this field.
This textbook provides a comprehensive overview of machine learning, covering both the theoretical foundations and practical applications. It valuable resource for anyone interested in learning more about this field.
This textbook provides a comprehensive overview of deep learning, covering both the theoretical foundations and practical applications. It valuable resource for anyone interested in learning more about this field.
This textbook provides a comprehensive overview of probabilistic graphical models, which are a powerful tool for representing and reasoning about uncertainty. It valuable resource for anyone interested in learning more about this field.
This textbook provides a comprehensive overview of information theory, inference, and learning algorithms. It valuable resource for anyone interested in learning more about these fields.
This textbook provides a comprehensive overview of Bayesian reasoning and machine learning. It valuable resource for anyone interested in learning more about these fields.
Provides a comprehensive overview of Markov decision processes, which are a fundamental mathematical framework for reinforcement learning. It valuable resource for anyone interested in understanding the theoretical foundations of reinforcement learning.
Provides a gentle introduction to machine learning, making it a good choice for those who are new to the field. It covers a wide range of topics, including reinforcement learning.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Similar courses are unavailable at this time. Please try again later.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser