We may earn an affiliate commission when you visit our partners.
Course image
Course image
Coursera logo

Introduction to Computer Vision

Tom Yeh

Introduction to Computer Vision guides learners through the essential algorithms and methods to help computers 'see' and interpret visual data. You will first learn the core concepts and techniques that have been traditionally used to analyze images. Then, you will learn modern deep learning methods, such as neural networks and specific models designed for image recognition, and how it can be used to perform more complex tasks like object detection and image segmentation. Additionally, you will learn the creation and impact of AI-generated images and videos, exploring the ethical considerations of such technology.

Read more

Introduction to Computer Vision guides learners through the essential algorithms and methods to help computers 'see' and interpret visual data. You will first learn the core concepts and techniques that have been traditionally used to analyze images. Then, you will learn modern deep learning methods, such as neural networks and specific models designed for image recognition, and how it can be used to perform more complex tasks like object detection and image segmentation. Additionally, you will learn the creation and impact of AI-generated images and videos, exploring the ethical considerations of such technology.

This course can be taken for academic credit as part of CU Boulder’s MS in Computer Science degree offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: https://coursera.org/degrees/ms-computer-science-boulder.

Enroll now

What's inside

Syllabus

Week 1
This module introduces foundational concepts related to common image types and functions. It offers a comprehensive overview of different formats and their unique characteristics. This section establishes the context for understanding how images are represented and processed in various applications. Next, the module delves into image functions, explaining the basic operations that can be performed on images to enhance or manipulate them, such as cropping, resizing, or adjusting brightness. It also covers more advanced operations like filtering and thresholding, illustrating how these functions play a crucial role in image processing. Then the module explores the underlying mathematics of image transformations. It starts with linear transforms, highlighting their application in image scaling, rotation, and translation. The module then introduces homogeneous coordinates, providing a simplified approach to represent complex transformations with additional dimensions. This leads into a deeper exploration of homogeneous transformations, demonstrating how they are used to perform multiple transformations in a single step.
Read more
Week 2
This module provides a deep dive into image analysis and similarity assessment techniques. It starts by exploring the basic concept of comparing pixels, highlighting how individual pixel values can be used to gauge similarity. This is followed by a detailed discussion on comparing multiple images by their features, emphasizing the advantages of feature-based analysis over pixel-by-pixel comparison. The module introduces the concept of image moments, revealing how these statistical properties help identify shapes and patterns within images. The module then addresses similarity and distance, offering a quick overview of how these concepts are calculated and applied in image processing. You'll also learn about converting pixels into distributions, an essential technique for more complex analysis. This leads to a comprehensive explanation of cross-entropy, providing insights into its role in measuring the dissimilarity between distributions. You'll explore cross-correlation in 1D, followed by a deeper examination of cross-correlation as matrix multiplication. The module wraps up by exploring cross-correlation in more detail, with a focus on the mathematics behind it.
Week 3
This module delves into multiview geometry, a pivotal concept in computer vision and 3D modeling. It starts with a brief overview of the motivation behind multiview systems, highlighting the advantages of capturing scenes from multiple viewpoints. The module then discusses multiple coordinate systems, exploring how different reference frames can describe points and transformations in 3D space. You'll also learn about multiple viewing planes, which play a crucial role in multiview setups by providing unique perspectives for scene reconstruction. The focus shifts to multiview projection, examining how distinct images from multiple cameras can be used to create a cohesive 3D scene. You'll gain insights into the principles of translation and rotation in 3D, crucial for understanding camera movement and orientation. The module also covers camera translation and camera rotation, offering practical examples to illustrate how camera motion affects the geometry and visual representation of a scene.
Week 4
This module delves into key concepts of camera models and their role in computer vision and photogrammetry. Learn about the Extrinsic Matrix, exploring how it defines the position and orientation of a camera in 3D space. Understand the Pinhole Camera Model, a simplified optical system that forms the basis for many computer vision applications, alongside the Intrinsic Matrix, which captures the internal parameters of the camera. Epipolar geometry is examined, with a focus on its significance in 3D reconstruction and stereo vision. The module covers the motivation behind epipolar geometry, breaking down its basic components, and explaining the Essential Matrix, which encapsulates the geometric relationship between camera views, as well as the Fundamental Matrix, a core component in epipolar geometry that represents the relationship between two cameras in stereo vision.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Prerequisites are clearly outlined
Builds a strong foundation for beginners
Develops professional skills
Taught by instructors recognized for their work in computer vision
Course content is relevant to industry trends
Emphasizes hands-on learning

Save this course

Save Introduction to Computer Vision to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Introduction to Computer Vision with these activities:
Read Computer Vision: Algorithms and Applications
Enhance your theoretical foundation by reading a comprehensive book on computer vision, gaining a deeper understanding of the field's core algorithms and applications.
View Computer Vision on Amazon
Show steps
  • Obtain a copy of the book.
  • Read the book in its entirety.
  • Take notes and highlight important concepts.
Create a Tutorial on Image Filtering
Solidify your understanding of image filtering by creating a tutorial that explains the concepts and techniques, deepening your knowledge and helping others learn.
Browse courses on Image Filtering
Show steps
  • Review the concepts and techniques of image filtering.
  • Choose a specific type of image filter to focus on.
  • Write a tutorial explaining the filter, including its purpose, how it works, and how to implement it.
  • Share your tutorial with others.
Attend a Computer Vision Hackathon
Immerse yourself in a hands-on environment at a computer vision hackathon, where you can collaborate with others and push your skills to the next level.
Show steps
  • Find a computer vision hackathon in your area.
  • Register for the hackathon.
  • Prepare for the hackathon by reviewing course materials.
  • Attend the hackathon and collaborate with other participants.
  • Develop a computer vision project during the hackathon.
One other activity
Expand to see all activities and additional details
Show all four activities
Develop a Computer Vision Model for a Specific Application
Apply your knowledge by developing a computer vision model for a specific application, honing your skills in model design, training, and evaluation.
Show steps
  • Choose a specific computer vision application.
  • Design a computer vision model for the application.
  • Train the model using a suitable dataset.
  • Evaluate the performance of the model.

Career center

Learners who complete Introduction to Computer Vision will develop knowledge and skills that may be useful to these careers:

Reading list

We haven't picked any books for this reading list yet.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Introduction to Computer Vision.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser