Carnegie Mellon University

Electrical and Computer Engineering

College of Engineering

Course Information

18-794: Introduction to Deep Learning and Pattern Recognition for Computer Vision




Introduction to Deep Learning and Pattern Recognition for Computer Vision will focus on Deep Learning algorithms used in Computer Vision applications while explaining the pattern recognition aspect of these algorithms. The first half of this course is also available as a mini (18-790) which introduces students to the basic Deep Learning ML techniques in the course. The course will first introduce Neural networks and how they perform recognition and their evolution to Deep Neural Networks, as well as different DNN backbone architectures (e.g. VGG, ResNet + variations, MobileNets, etc.) used for classification. We will overview DL architectures for object detection (to include a large range of algorithms such as anchor-based and anchor free, single stage, two-stage as well as well-known Yolo, SSD, FCOS, CornerNet, Mask-RCNN, DETR and others). We cover object recognition, semantic segmentation (with applications in robot vision, autonomous driving, general scene understanding, medical analysis), and other topics including instance segmentation, loss functions, feature extraction, Transformers, Generative Models, Neural Architecture Search (NAS), low form factor Deep Learning architectures for embedded platforms (e.g., Jetson Nano, AGX), TensorRT for model optimization on Nvidia embedded platforms, and ONNX model conversions.

Last Modified: 2024-06-28 12:44PM

Semesters offered:

  • Fall 2024
  • Fall 2023
  • Fall 2022
  • Fall 2021
  • Fall 2020
  • Fall 2019
  • Fall 2018
  • Fall 2017
  • Fall 2016
  • Fall 2015
  • Fall 2014
  • Fall 2013
  • Fall 2012
  • Fall 2011
  • Spring 2011
  • Spring 2010
  • Spring 2009
  • Spring 2008
  • Spring 2007
  • Spring 2005
  • Spring 2004
  • Spring 2003
  • Spring 2001
  • Spring 1999