Course Details

Exam Registration258
Course StatusOngoing
Course TypeElective
LanguageEnglish
Duration12 weeks
CategoriesElectrical, Electronics and Communications Engineering
Credit Points3
LevelPostgraduate
Start Date19 Jan 2026
End Date10 Apr 2026
Enrollment Ends02 Feb 2026
Exam Registration Ends20 Feb 2026
Exam Date25 Apr 2026 IST
NCrF Level4.5 — 8.0

Master the Future of AI with Deep Learning for Visual Computing

In the rapidly evolving field of artificial intelligence, the ability to interpret and understand visual data is paramount. From medical diagnostics to autonomous vehicles, deep learning has revolutionized how machines perceive the world. If you're a postgraduate student or professional in Electrical, Electronics, or Computer Sciences looking to master these cutting-edge skills, a new course from the Indian Institute of Technology Kharagpur offers a definitive pathway.

About the Course Instructor: Prof. Debdoot Sheet

This 12-week intensive course is led by Prof. Debdoot Sheet, an esteemed Assistant Professor in the Department of Electrical Engineering at IIT Kharagpur. Prof. Sheet is also the founder of SkinCurate Research and brings a wealth of expertise to the classroom.

His academic credentials include MS and PhD degrees in computational medical imaging and machine learning from IIT Kharagpur, complemented by experience as a DAAD visiting PhD scholar at TU Munich. His research spans deep learning, domain adaptation, computational medical imaging, and surgical analytics. A prolific contributor to top-tier journals and conferences, Prof. Sheet is also an active member of IEEE, SPIE, ACM, and serves as an Editor for IEEE Pulse.

What is Deep Learning for Visual Computing?

Deep learning is a subset of machine learning that uses layered, hierarchical architectures to learn abstractions from data. In visual computing, this means teaching a machine to understand an image in strata—much like how humans do.

Consider recognizing a person standing before a mountain in a photo. A deep learning model doesn't see the scene holistically at first. Instead:

  • Lower layers identify basic features: edges, lines, colors, and curves.
  • Middle layers combine these to recognize shapes, textures, and object parts (like a limb or a tree).
  • Higher layers synthesize this information to identify complex objects (a human, a mountain).
  • The final layer makes the contextual judgment: "Mr. X is standing in front of Mt. E."

This course is designed to unravel this "hierarchical logic," teaching you not just the theory but also how machines can learn these representative features autonomously. You'll explore applications ranging from image captioning and synthetic image generation to the technology behind self-driving cars.

Course Details at a Glance

AspectDetail
Duration12 Weeks
LevelPostgraduate
Primary CategoriesElectrical, Electronics and Communications Engineering, Computer Sciences
PrerequisitesKnowledge of Digital Image Processing & Machine Learning
Tools & LanguagesPython, PyTorch
Industry SupportIntel, Microsoft, Google, NVIDIA, Philips, Siemens, Tesla, TCS, Infosys, and many more.

Weekly Course Layout: A Journey from Fundamentals to Advanced Models

The course is meticulously structured to build your expertise from the ground up over 12 weeks.

Weeks 1-4: Foundations of Neural Networks

  • Week 1: Introduction to Visual Computing and Neural Networks.
  • Week 2: Evolving from Multilayer Perceptrons to Deep Neural Networks with Autoencoders.
  • Weeks 3 & 4: Deep dive into Autoencoders for representation learning, initialization, and advanced variants like Stacked, Sparse, and Denoising Autoencoders.

Weeks 5-9: Mastering Convolutional Neural Networks (CNNs)

  • Week 5: Core concepts of Cost functions, Learning Rate Dynamics, and Optimization.
  • Week 6: Introduction to CNNs and the pioneering LeNet architecture.
  • Week 7: Convolutional Autoencoders and deeper architectures like AlexNet and VGGNet.
  • Week 8: Exploring very deep networks: GoogLeNet, ResNet, and DenseNet.
  • Week 9: Tackling Computational Complexity and the power of Transfer Learning.

Weeks 10-12: Advanced Applications and Models

  • Week 10: Moving beyond classification to Object Localization (R-CNN) and Semantic Segmentation.
  • Week 11: Unleashing creativity with Generative Models and Adversarial Learning (GANs).
  • Week 12: Handling sequential visual data with Recurrent Neural Networks (RNNs) for Video Classification.

Who Should Enroll?

This course is ideally suited for:

  • Postgraduate students in Electrical, Electronics, Computer Science, or related fields.
  • Professionals and researchers aiming to build or enhance their expertise in deep learning for computer vision.
  • Individuals with a solid foundation in Digital Image Processing and basic Machine Learning concepts.

Key Learning Outcomes and Resources

By the end of this course, you will gain both theoretical insights and hands-on coding practice with modern frameworks like PyTorch. The curated exercises are designed to solidify your understanding of current developments in the field.

The course references seminal texts to guide your learning:

  • Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
  • Haykin, S. (2008). Neural Networks and Learning Machines (3rd ed.). Pearson.

With strong backing from leading global industries, the skills you acquire will be directly relevant to the job market, opening doors to roles in tech giants, healthcare imaging, automotive AI, and cutting-edge research.

Embark on this 12-week journey to demystify deep learning and become proficient in building intelligent systems that see and understand our visual world. Enroll today and transform your potential into expertise.

Enroll Now →

Explore More

Mock Test All Courses Start Learning Today