We may earn an affiliate commission when you visit our partners.

Course image

Computer Vision and Generative AI

Brian Cruz, Emily McMilin, Victor Geislinger, Jason Lin, Erick Galinkin, Giacomo Vianello, Chuyi Shang, Annabel Ng, Derek Xu, Nathaniel Haynam, Valerie Scarlata, Chang She, and Sergei Kozyrenko

Brian Cruz, Emily McMilin, Victor Geislinger, Jason Lin, Erick Galinkin, Giacomo Vianello, Chuyi Shang, Annabel Ng, Derek Xu, Nathaniel Haynam, Valerie Scarlata, Chang She, and Sergei Kozyrenko

Learn cutting edge AI skills and unlock AI's visual power with Udacity's Computer Vision & Generative AI Training Course. Enroll to become an AI innovator!

Prerequisite details

To optimize your success in this program, we've created a list of prerequisites and recommendations to help you prepare for the curriculum. Prior to enrolling, you should have the following knowledge:

Transformer neural networks
Hugging Face
Deep learning
Prompt Engineering
PyTorch
Foundation Models
Generative AI Fluency
Intermediate Python

Read more

Learn cutting edge AI skills and unlock AI's visual power with Udacity's Computer Vision & Generative AI Training Course. Enroll to become an AI innovator!

Prerequisite details

To optimize your success in this program, we've created a list of prerequisites and recommendations to help you prepare for the curriculum. Prior to enrolling, you should have the following knowledge:

Transformer neural networks
Hugging Face
Deep learning
Prompt Engineering
PyTorch
Foundation Models
Generative AI Fluency
Intermediate Python

You will also need to be able to communicate fluently and professionally in written and spoken English.

Here's a deal for you

We found an offer that may be relevant to this course.

Save money when you learn. All coupon codes, vouchers, and discounts are applied automatically unless otherwise noted.

Use code at checkout. Valid for a limited time only

Bootcamp level quality

Complete hands-on projects and get personalized feedback with Udacity.

What's inside

Syllabus

In this lesson, you will define image generation and understand its relevance in AI and machine learning.

Learn how computers see images and perform key image processing techniques using classic image processing techniques such as image transformation, noise reduction, and more.

Read more

Explore the landscape of Gen AI tools for Computer Vision and learn how they are evaluated. Learn what a generative adversarial network is and how it is utilized to generate images.

In this lesson, we will be exploring Vision Transformers and the architecture that makes them work. Along the way we will explore Vision Transformers like DALL-E, DINO, and SAM.

Learn the fundamentals of transformers. Then, get hands-on with the creation of a diffusion algorithm and work with Huggingface Diffusers to generate and work with images.

In this project, you will utilize Generative AI to take a famous painting and swap out the background with an image generated by Stable Diffusion.

Good to know

Know what's good

, what to watch for

, and possible dealbreakers

Taught by recognized experts in the field of Computer Vision and Generative AI, including Brian Cruz, Emily McMilin, Victor Geislinger, and more

Covers the latest advancements in image generation, generative adversarial networks, and diffusion algorithms

Provides hands-on experience with Hugging Face Diffusers, a state-of-the-art library for image generation

Develops a strong foundation in Generative AI, which is increasingly used in various industries

Requires prerequisites in deep learning, transformer neural networks, and prompt engineering

Assumes some background knowledge in image processing techniques

Save this course

Save Computer Vision and Generative AI to your list so you can find it easily later:

Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Computer Vision and Generative AI with these activities:

Prompt Engineering Exercises

Show steps

Enhance your proficiency in prompt engineering, enabling you to craft effective prompts that yield exceptional image generation results.

Browse courses on Prompt Engineering

Show steps

Practice writing clear and concise prompts.
Experiment with different prompt elements.
Iterate and refine your prompts based on feedback.

Image Generation: Artistic Exploration

Show steps

Foster your creativity and explore the artistic possibilities of image generation, expanding your horizons and inspiring novel artistic approaches.

Browse courses on Image Generation

Show steps

Experiment with different prompts and generation settings.
Create a collection of visually striking images.
Share your creations and engage with the community.

Walkthrough: Vision Transformers for Image Generation

Show steps

Build a strong conceptual framework for vision transformers, empowering you to confidently navigate complex image generation tasks.

Show steps

Explore the fundamentals of Vision Transformers.
Delve into the architecture of DALL-E.
Practice implementing Vision Transformers for image generation.

One other activity

Expand to see all activities and additional details

Show all four activities

Diffusion Model Implementation

Show steps

Solidify your understanding of diffusion models by implementing your own from scratch, enhancing your ability to generate and manipulate images.

Browse courses on Diffusion Models

Show steps

Gain a theoretical understanding of diffusion models.
Implement a diffusion model using Hugging Face Diffusers.
Generate images using your implemented model.

Career center

Learners who complete Computer Vision and Generative AI will develop knowledge and skills that may be useful to these careers:

Generative AI Engineer

A Generative AI Engineer develops and deploys generative AI models. The Computer Vision and Generative AI course can provide a strong foundation for a career in Generative AI Engineering by teaching students how to use AI to generate images, text, and other types of data. This course covers techniques such as generative adversarial networks, transformers, and diffusion models, which are essential skills for Generative AI Engineers who work with visual data. Additionally, the course's focus on Computer Vision can help Generative AI Engineers develop new methods for generating and manipulating images.

See salaries and explore the career path for Generative AI Engineer

Computer Vision Engineer

A Computer Vision Engineer develops and deploys computer vision systems. The Computer Vision and Generative AI course can provide a strong foundation for a career in Computer Vision Engineering by teaching students how to use AI to process and analyze images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Computer Vision Engineers who work with visual data. Additionally, the course's focus on Generative AI can help Computer Vision Engineers develop new methods for generating and manipulating images.

See salaries and explore the career path for Computer Vision Engineer

AI Researcher

An AI Researcher develops new AI algorithms and techniques. The Computer Vision and Generative AI course can provide a strong foundation for a career in AI Research by teaching students how to use AI to process and analyze images. This course covers techniques such as deep learning, reinforcement learning, and generative adversarial networks, which are essential skills for AI Researchers who work with visual data. Additionally, the course's focus on Generative AI can help AI Researchers develop new methods for generating and manipulating images.

See salaries and explore the career path for AI Researcher

Machine Learning Engineer

A Machine Learning Engineer designs, builds, and deploys machine learning models. The Computer Vision and Generative AI course can provide a strong foundation for a career in Machine Learning Engineering by teaching students how to use AI to create models that can recognize and generate images. This course covers techniques such as convolutional neural networks, generative adversarial networks, and transformers, which are essential skills for Machine Learning Engineers who work with visual data.

See salaries and explore the career path for Machine Learning Engineer

Robotics Engineer

A Robotics Engineer designs, builds, and deploys robots. The Computer Vision and Generative AI course can provide a strong foundation for a career in Robotics Engineering by teaching students how to use AI to develop robots that can see and interact with the world around them. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Robotics Engineers who work with visual data. Additionally, the course's focus on Generative AI can help Robotics Engineers develop new methods for generating and manipulating data to train robots.

See salaries and explore the career path for Robotics Engineer

Animator

An Animator creates animated content for a variety of purposes, such as film, television, and video games. The Computer Vision and Generative AI course can provide a strong foundation for a career in Animation by teaching students how to use AI to create realistic and engaging animations. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Animators who work with visual data. Additionally, the course's focus on Generative AI can help Animators develop new methods for generating and manipulating animations.

See salaries and explore the career path for Animator

Game Developer

A Game Developer designs and develops video games. The Computer Vision and Generative AI course can provide a strong foundation for a career in Game Development by teaching students how to use AI to create visually appealing and engaging games. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Game Developers who work with visual data. Additionally, the course's focus on Generative AI can help Game Developers develop new methods for generating and manipulating game content.

See salaries and explore the career path for Game Developer

Data Scientist

A Data Scientist analyzes data to extract meaningful insights and make predictions. The Computer Vision and Generative AI course can provide a strong foundation for a career in Data Science by teaching students how to use AI to process and analyze images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Data Scientists who work with visual data. Additionally, the course's focus on Generative AI can help Data Scientists develop new methods for generating and manipulating data.

See salaries and explore the career path for Data Scientist

Data Analyst

A Data Analyst collects, analyzes, and interprets data to identify trends and patterns. The Computer Vision and Generative AI course can provide a strong foundation for a career in Data Analysis by teaching students how to use AI to process and analyze images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Data Analysts who work with visual data. Additionally, the course's focus on Generative AI can help Data Analysts develop new methods for generating and manipulating images.

See salaries and explore the career path for Data Analyst

Graphic designer

A Graphic Designer creates visual content for a variety of purposes, such as marketing, advertising, and web design. The Computer Vision and Generative AI course can provide a strong foundation for a career in Graphic Design by teaching students how to use AI to create visually appealing and engaging content. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Graphic Designers who work with visual data. Additionally, the course's focus on Generative AI can help Graphic Designers develop new methods for generating and manipulating images.

See salaries and explore the career path for Graphic designer

Photographer

A Photographer captures images for a variety of purposes, such as art, journalism, and commercial photography. The Computer Vision and Generative AI course can provide a strong foundation for a career in Photography by teaching students how to use AI to enhance and manipulate images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Photographers who work with visual data. Additionally, the course's focus on Generative AI can help Photographers develop new methods for generating and manipulating images.

See salaries and explore the career path for Photographer

Videographer

A Videographer captures and edits video footage for a variety of purposes, such as film, television, and commercials. The Computer Vision and Generative AI course can provide a strong foundation for a career in Videography by teaching students how to use AI to enhance and manipulate video footage. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Videographers who work with visual data. Additionally, the course's focus on Generative AI can help Videographers develop new methods for generating and manipulating video footage.

See salaries and explore the career path for Videographer

Software Engineer

A Software Engineer designs, builds, and deploys software applications. The Computer Vision and Generative AI course can provide a strong foundation for a career in Software Engineering by teaching students how to use AI to develop software applications that can process and analyze images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Software Engineers who work with visual data. Additionally, the course's focus on Generative AI can help Software Engineers develop new methods for generating and manipulating images.

See salaries and explore the career path for Software Engineer

UX Designer

A UX Designer designs user interfaces for software products. The Computer Vision and Generative AI course can provide a strong foundation for a career in UX Design by teaching students how to use AI to design user interfaces that are visually appealing and easy to use. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for UX Designers who work with visual data. Additionally, the course's focus on Generative AI can help UX Designers develop new methods for generating and manipulating images.

See salaries and explore the career path for UX Designer

Product Manager

A Product Manager develops and manages software products. The Computer Vision and Generative AI course can provide a strong foundation for a career in Product Management by teaching students how to use AI to develop software products that can process and analyze images. This course covers techniques such as image transformation, noise reduction, and segmentation, which are essential skills for Product Managers who work with visual data. Additionally, the course's focus on Generative AI can help Product Managers develop new methods for generating and manipulating images.

See salaries and explore the career path for Product Manager

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Computer Vision and Generative AI.

Cover image

Cover image

Save

Seminal work on GANs and provides a deep dive into the theory and practice of these models.

Deep Learning (Adaptive Computation and Machine...

Deep Learning (Adaptive Computation and Machine...

Cover image

Cover image

Four Laws for the Artificially Intelligent

Save

This research paper introduces GANs and explores their applications in image generation.

Four Laws for the Artificially Intelligent

Four Laws for the Artificially Intelligent

Cover image

Cover image

Concurrent NumPy in Python

Save

Provides a practical introduction to deep learning for computer vision, covering topics such as convolutional neural networks, object detection, and image segmentation.

Concurrent NumPy in Python: Faster NumPy With BLAS,...

Concurrent NumPy in Python: Faster NumPy With BLAS,...

Cover image

Cover image

Deep Learning with Python, Second Edition

Save

Great resource for deep learning fundamentals and provides a solid basis for understanding the concepts covered in the Computer Vision and Generative AI course.

Deep Learning with Python, Second Edition

Deep Learning with Python, Second Edition

Cover image

Cover image

Application of Chaos and Fractals to Computer Vision

Save

Provides a comprehensive overview of computer vision algorithms and applications, covering topics such as image formation, feature detection, object recognition, and image segmentation.

Application of Chaos and Fractals to Computer Vision

Application of Chaos and Fractals to Computer Vision

Cover image

Cover image

Probability and Statistics for Computer Science

Save

Provides a comprehensive and up-to-date overview of computer vision, covering topics such as image formation, feature detection, object recognition, and image segmentation.

Probability and Statistics for Computer Science

Probability and Statistics for Computer Science

Cover image

Cover image

Computer Vision

Save

Provides a comprehensive overview of computer vision algorithms and applications, covering topics such as image formation, feature detection, object recognition, and image segmentation.

Computer Vision

Computer Vision

Share

Help others find this course page by sharing it with your friends and followers:

Copy Link

Similar courses

Similar courses are unavailable at this time. Please try again later.

Course image

Effort

4 weeks

Level

Intermediate

Via

Udacity

Instructors

Brian Cruz

Emily McMilin

Victor Geislinger

Jason Lin

Erick Galinkin

Giacomo Vianello

Chuyi Shang

Annabel Ng

Derek Xu

Nathaniel Haynam

Valerie Scarlata

Chang She

Sergei Kozyrenko

Language

English

Good to know

Taught by recognized experts in the field of Computer Vision and Generative AI, including Brian Cruz, Emily McMilin, Victor Geislinger, and more

Covers the latest advancements in image generation, generative adversarial networks, and diffusion algorithms

Provides hands-on experience with Hugging Face Diffusers, a state-of-the-art library for image generation

Develops a strong foundation in Generative AI, which is increasingly used in various industries

Requires prerequisites in deep learning, transformer neural networks, and prompt engineering

Assumes some background knowledge in image processing techniques

Share and help others discover this course.

Link

Don't miss out

Enroll today to gain access to this course

Enroll in this course

Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser