We may earn an affiliate commission when you visit our partners.

Gradient Descent

Save
May 1, 2024 Updated May 11, 2025 29 minute read

Gradient Descent is an optimization algorithm at the core of many machine learning models. Imagine trying to find the lowest point in a foggy valley. You'd likely take a step in the direction that seems to go down the most, then re-evaluate and take another step, continuing until you can't go any lower. Gradient Descent works similarly, iteratively adjusting parameters to minimize a function, typically a "loss" or "cost" function that measures how far off a model's predictions are from the actual values. This process is fundamental to training models that can learn from data and make accurate predictions or decisions.

Working with Gradient Descent can be engaging for several reasons. Firstly, it's a foundational concept that unlocks the inner workings of many powerful AI technologies. Understanding how models "learn" through optimization is intellectually stimulating. Secondly, the ability to fine-tune and experiment with different aspects of Gradient Descent, such as learning rates and batch sizes, to improve model performance can be a rewarding challenge. Finally, seeing a model you've trained using Gradient Descent make accurate predictions on new, unseen data provides a tangible sense of accomplishment and demonstrates the practical power of this algorithm.

What is Gradient Descent?

Definition and Basic Analogy

At its heart, Gradient Descent is an iterative optimization algorithm used to find the minimum of a function. Think of it like a hiker trying to get to the bottom of a valley in the dark. The hiker can only feel the slope of the ground beneath their feet. To descend, they will take a step in the direction where the slope is steepest downwards. After each step, they re-evaluate the slope and take another step, continuing this process until they reach a point where all directions lead upwards, indicating they are at a local minimum (hopefully the lowest point in the valley). In mathematical terms, this "slope" is the gradient of the function, and each "step" is an adjustment to the model's parameters.

Path to Gradient Descent

Take the first step.
We've curated 24 courses to help you on your path to Gradient Descent. Use these to develop your skills, build background knowledge, and put what you learn to practice.
Sorted from most relevant to least relevant:

Share

Help others find this page about Gradient Descent: by sharing it with your friends and followers:

Reading list

We've selected three books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Gradient Descent.
Provides a comprehensive treatment of convex optimization, which powerful technique for solving a wide range of optimization problems. Gradient descent key algorithm for solving convex optimization problems.
Provides a probabilistic perspective on machine learning, which is essential for understanding the theoretical foundations of gradient descent. It covers a wide range of machine learning topics, including supervised learning, unsupervised learning, and reinforcement learning.
Provides a comprehensive overview of deep learning, which powerful technique for machine learning. Gradient descent key algorithm for training deep learning models.
Table of Contents
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser