Deep Learning for Computer Vision

Enjoy unlimited growth with a year of Coursera Plus for $199 (regularly $399). Save now.

Deep Learning for Computer Vision

This course is part of Computer Vision Specialization

Instructor: Tom Yeh

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Build toward a degree

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Build toward a degree

Learn more

What you'll learn

Improve model performance and training stability using multilayer perceptrons (MLPs) and applying normalization techniques.
Implement autoencoders for unsupervised feature learning and design Generative Adversarial Networks (GANs) to generate synthetic images.
Train convolutional neural networks (CNNs) for image classification tasks, understanding how layers extract spatial features from visual data.
Apply advanced architectures like ResNet for deep image recognition and U-Net for image segmentation.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Computer Vision Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

Unlock the power of deep learning to transform visual data into actionable insights. This hands-on course guides you through the foundational and advanced techniques that drive modern computer vision applications—from image classification to generative modeling.

You'll begin with the building blocks of deep learning - understanding how multilayer perceptrons (MLPs) work, and exploring normalization techniques that stabilize and accelerate training. You'll then dive into unsupervised learning with autoencoders and discover the magic behind Generative Adversarial Networks (GANs) that can create realistic images from noise. After, you'll master the architecture that revolutionized computer vision by learning how CNNs extract spatial hierarchies and patterns from images for tasks like object detection and recognition. Finally, you'll explore cutting-edge architectures. ResNet introduces residual learning for deeper networks, while U-Net powers precise image segmentation in medical imaging and beyond. Whether you're a data scientist, engineer, or AI enthusiast, this course equips you with the skills to build and deploy deep learning models for real-world vision tasks. With practical examples and guided learning, you'll gain both theoretical understanding and hands-on experience. This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder

Welcome to Deep Learning for Computer Vision, the second course in the Computer Vision specialization. In this first module, you'll be introduced to the principles behind neural networks and their use in visual recognition tasks. You'll begin by learning the basic building blocks—neurons, weights, biases—and progress toward constructing simple multi-layer perceptrons. Then, you'll discover key activation concepts like batch processing and graph-matrix conversions. Finally, you will visualize neural networks with an emphasis on classification tasks.

What's included

19 videos8 readings7 assignments

19 videosTotal 110 minutes

Meet Your Instructor 2 minutes
Graph to Matrix7 minutes
Matrix to Graph10 minutes
Bias3 minutes
Batch3 minutes
ReLU and LeakyReLU6 minutes
Hidden Layer and Sigmoid6 minutes
ReLU vs. LeakyReLU vs. Sigmoid2 minutes
Visualize Neurons7 minutes
Visualization4 minutes
Hidden Layer8 minutes
Output Layer7 minutes
Equation3 minutes
Calculation4 minutes
pyTorch8 minutes
Classification5 minutes
Softmax4 minutes
Batch Normalization6 minutes
Layer Normalization5 minutes

8 readingsTotal 51 minutes

Earn Academic Credit for your Work!10 minutes
Course Support10 minutes
Inside the Course5 minutes
Assessment Expectations10 minutes
AI Citation and Acknowledgement10 minutes
Get the Workbook: Neural Network2 minutes
Get the Workbook: Multi-Layer Perceptron2 minutes
Get the Workbook: Normalization2 minutes

7 assignmentsTotal 125 minutes

Neural Network Part One15 minutes
Neural Network Part Two15 minutes
Multi-Layer Perceptron Part One15 minutes
Multi-Layer Perceptron Part Two15 minutes
Normalization15 minutes
AI Policy Quiz5 minutes
Neural Network, Multi-Layer Perceptron, and Normalization45 minutes

In this module, you’ll explore two powerful architectures in deep learning: autoencoders and generative adversarial networks (GANs). You’ll begin by learning how autoencoders compress and reconstruct data using encoder-decoder structures, and how reconstruction loss is minimized through backpropagation and gradient descent. You’ll then examine the role of loss functions and optimization techniques in training these models. In the second half of the module, you’ll dive into GANs, where a generator and discriminator compete to produce realistic synthetic data. You’ll study how adversarial training works, how binary cross-entropy loss is applied, and how GANs are used to model complex data distributions. By the end of this module, you’ll be able to implement and evaluate both autoencoders and GANs for representation learning and data generation.

What's included

13 videos2 readings5 assignments

13 videosTotal 109 minutes

Encoder/Decoder Example 110 minutes
Encoder/Decoder Example 26 minutes
Larger Encoder/Decoder Architecture4 minutes
Loss Function6 minutes
Loss Gradient2 minutes
Backpropagation14 minutes
Gradient Desent8 minutes
Tiny GAN9 minutes
Generator6 minutes
Discriminator8 minutes
Binary Cross Entropy Loss10 minutes
BCE Loss Gradient5 minutes
Adversarial Training13 minutes

2 readingsTotal 4 minutes

Get the Workbook: Auto Encoder2 minutes
Get the Workbook: GAN2 minutes

5 assignmentsTotal 90 minutes

Auto Encoder Part One15 minutes
Auto Encoder Part Two15 minutes
GAN Part One15 minutes
GAN Part Two15 minutes
Auto Encoder and GAN30 minutes

In this module, you’ll learn how convolutional neural networks extract features from images and perform classification. You’ll begin by building a tiny CNN by hand and in Excel, exploring convolution, max-pooling, and fully connected layers. Then, you’ll scale up to larger CNN architectures and examine how they process data through multiple convolution and pooling stages. You’ll also study how categorical cross-entropy loss and gradients are computed for training. Finally, you’ll walk through backpropagation across all CNN layers to understand how learning occurs.

What's included

16 videos1 reading5 assignments

16 videosTotal 72 minutes

Tiny CNN by Hand6 minutes
Tiny CNN - Excel Formulas2 minutes
Tiny CNN - Graphical Representation3 minutes
Tiny CNN - Maxpool4 minutes
Tiny CNN - Fully Connected4 minutes
Large CNN Overview1 minute
Large CNN - Conv 15 minutes
Large CNN - Maxpool 12 minutes
Large CNN - Conv and Maxpool 25 minutes
Categorical Cross Entropy Loss9 minutes
CNN Loss Gradient6 minutes
Backpropagation3 minutes
Backpropagation - Fully Connected Layer4 minutes
Backpropagation - Maxpool Layer4 minutes
Backpropagation - ReLU and Convolution Layer4 minutes
CNN Takeaways2 minutes

1 readingTotal 2 minutes

Get the Workbook: CNN2 minutes

5 assignmentsTotal 90 minutes

CNN Part One15 minutes
CNN Part Two15 minutes
CNN Part Three15 minutes
CNN Part Four15 minutes
Convolutional Neural Networks30 minutes

In this module, you’ll explore two influential deep learning architectures: ResNet and U-Net. You’ll begin by learning how ResNet uses skip connections and residual learning to enable the training of very deep networks, addressing challenges like vanishing and exploding gradients. You’ll examine how residual blocks preserve information and support higher-order logic across layers. Then, you’ll shift to U-Net, a powerful architecture for image segmentation, and study its encoder-decoder structure, skip connections, and upsampling techniques like transposed convolution. By the end of this module, you’ll understand how both architectures enhance learning efficiency and performance in complex vision tasks.

What's included

17 videos2 readings5 assignments

17 videosTotal 96 minutes

First-Order Logic7 minutes
Second-Order Logic4 minutes
Mixture of First and Second Order Logic5 minutes
Skip Connection3 minutes
Residual7 minutes
Deep6 minutes
Add & Norm4 minutes
Exploding / Vanishing Gradients5 minutes
U-Net Overview3 minutes
Concat4 minutes
Add Different Dimensions7 minutes
U-Net Encoder4 minutes
U-Net Decoder5 minutes
Parametric Upscaling 3 minutes
Transposed Convolution10 minutes
Conv U-Net Encoder5 minutes
Conv U-Net Decoder7 minutes

2 readingsTotal 4 minutes

Get the Workbook: ResNet2 minutes
Get the Workbook: U-Net2 minutes

5 assignmentsTotal 90 minutes

ResNet Part One15 minutes
ResNet Part Two15 minutes
U-Net Part One15 minutes
U-Net Part Two15 minutes
ResNet and U-Net30 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

Instructor

Tom Yeh

University of Colorado Boulder

4 Courses14,435 learners

Offered by

University of Colorado Boulder

Explore more from Algorithms

Status: Preview
University of Colorado Boulder
Deep Learning Applications for Computer Vision
Course
Status: Free Trial
MathWorks
Introduction to Deep Learning for Computer Vision
Course
Status: Free Trial
University of Colorado Boulder
Introduction to Deep Learning
Course
Status: Free Trial
University of Colorado Boulder
Modern AI Models for Vision and Multimodal Understanding
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.