We may earn an affiliate commission when you visit our partners.

SARSA

Save
May 11, 2024 3 minute read

SARSA (State-Action-Reward-State-Action) is an on-policy temporal difference learning algorithm used in reinforcement learning to estimate the value of a state-action pair. It is an extension of the Q-learning algorithm and was developed by Rummery and Niranjan in 1994. SARSA is an iterative algorithm that learns the value of a state-action pair by repeatedly taking an action in a state and observing the resulting reward and next state. The value of the state-action pair is then updated based on the observed reward and the estimated value of the next state-action pair.

How SARSA Works

SARSA works by maintaining a value function that estimates the value of each state-action pair. The value function is updated after each action is taken, based on the observed reward and the estimated value of the next state-action pair. The update rule for the value function is given by the following equation:

Q(s, a) <= Q(s, a) + α * (r + γ * Q(s', a') - Q(s, a))

where:

Share

Help others find this page about SARSA: by sharing it with your friends and followers:

Reading list

We've selected six books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in SARSA.
Provides a comprehensive overview of reinforcement learning with function approximation, including a chapter on SARSA. It is written by two of the leading researchers in the field and valuable resource for anyone interested in this topic.
Provides a comprehensive overview of multi-agent reinforcement learning, including a chapter on SARSA. It is written by two of the leading researchers in the field and valuable resource for anyone interested in this topic.
Provides a comprehensive overview of reinforcement learning in games, including a chapter on SARSA. It is written by two of the leading researchers in the field and valuable resource for anyone interested in this topic.
Table of Contents
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser