NPTEL: Deep Learning For Visual Computing

Deep learning is a genre of machine learning algorithms that attempt to solve tasks by learning abstraction in data following a stratified description paradigm using non-­linear transformation architectures. When put in simple terms, say you want to make the machine recognize Mr. X standing in front of Mt. E on an image;; this task is a stratified or hierarchical recognition task. At the base of the recognition pyramid would be features which can discriminate flats, lines, curves, sharp angles, color;; higher up will be kernels which use this information to discriminate body parts, trees, natural scenery, clouds, etc.;; higher up it will use this knowledge to recognize humans, animals, mountains, etc.;; and higher up it will learn to recognize Mr. X and Mt. E and finally the apex lexical synthesizer module would say that Mr. X is standing in front of Mt. E. Deep learning is all about how you make machines synthesize this hierarchical logic and also learn these representative features and kernels all by itself. It has been used to solve problems like handwritten character recognition, object and product recognition and localization, image captioning, generating synthetic images to self driving cars. This course would provide you insights to theory and coding practice of deep learning for visual computing through curated exercises with Python and PyTorch on current developments.


Week 1: Introduction to Visual Computing and Neural Networks

Week 2: Multilayer Perceptron to Deep Neural Networks with Autoencoders

Week 3: Autoencoders for Representation Learning and MLP Initialization

Week 4: Stacked, Sparse, Denoising Autoencoders and Ladder Training

Week 5: Cost functions, Learning Rate Dynamics and Optimization

Week 6: Introduction to Convolutional Neural Networks (CNN) and LeNet

Week 7: Convolutional Autoencoders and Deep CNN (AlexNet, VGGNet)

Week 8: Very Deep CNN for Classification (GoogLeNet, ResNet, DenseNet)

Week 9: Computational Complexity and Transfer Learning of a Network

Week 10:Object Localization (RCNN) and Semantic Segmentation

Week 11:Generative Models with Adversarial Learning

Week 12: Recurrent Neural Networks (RNN) for Video Classification

0 Student
Cost Free Online Course
Pace Upcoming
Subject Deep Learning
Provider NPTEL
Language English
Calendar 12 weeks long

Disclosure: To support our site, Class Central may be compensated by some course providers.

+ Add to My Courses
FAQ View All
What are MOOCs?
MOOCs stand for Massive Open Online Courses. These are free online courses from universities around the world (eg. Stanford Harvard MIT) offered to anyone with an internet connection.
How do I register?
To register for a course, click on "Go to Class" button on the course page. This will take you to the providers website where you can register for the course.
How do these MOOCs or free online courses work?
MOOCs are designed for an online audience, teaching primarily through short (5-20 min.) pre recorded video lectures, that you watch on weekly schedule when convenient for you.  They also have student discussion forums, homework/assignments, and online quizzes or exams.

0 reviews for NPTEL's Deep Learning For Visual Computing

Write a review

Class Central

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free