Modern Computer Vision

From Image Processing to Deep Learning

Modern Computer Vision

Module 1

Introduction to Digital Image Processing

Overview of Digital Image Processing

Image Types & Representations

Image Formation in Cameras & Sensors

Stages of Image Processing

Image Compression & Encoding

Exercises

Module 2

Fundamentals of Computer Vision

Introduction to Computer Vision

Computer Vision Pipeline

Applications of Computer Vision

Challenges & Bias in AI Vision

Exercises

Module 3

Image Preprocessing & OpenCV

Introduction to OpenCV

Image Filtering & Morphological Operations

Color Space Conversions

Image Transformations (Geometric & Frequency Domain)

Contouring, Thresholding, and Edge Detection

Exercises

Module 4

Image Analysis & Classification

Image Classification Basics

Traditional Feature-Based Approaches (SIFT, SURF, ORB)

Modern Image Classification (CNN, ViT, CLIP)

Image Segmentation Techniques

Exercises

Module 5

Deep Learning for Computer Vision

Neural Networks & Backpropagation

Convolutional Neural Networks (CNNs)

Activation Functions & Optimization Methods

Data Augmentation & Transfer Learning

Self-Supervised Learning (DINO, BYOL, SimCLR)

Exercises

Module 6

Advanced CNN Architectures

LeNet, AlexNet, VGG

ResNet & EfficientNet

Inception & GoogLeNet

Vision Transformers (ViT, Swin Transformer)

CLIP: Multi-Modal Vision-Language Learning

Segment Anything Model (SAM) – New Era of Segmentation

Exercises

Module 7

Object Detection & Recognition

Object Detection Basics

R-CNN, Faster R-CNN, Mask R-CNN

YOLO Series (YOLOv7, YOLOv8, YOLO-NAS)

DETR: Transformer-Based Object Detection

Motion Analysis & Object Tracking (DeepSORT, ByteTrack)

Exercises

Module 8

Image Segmentation

Traditional Image Segmentation

Modern Deep Learning-Based Segmentation (UNet, DeepLabv3+)

Instance & Panoptic Segmentation

Exercises

Module 9

Generative Models & AI Creativity

Generative Adversarial Networks (GANs)

Diffusion Models (Stable Diffusion 3, DALLE-3)

Neural Radiance Fields (NeRF) – 3D Scene Reconstruction

Text-to-Image Models (MidJourney, DALLE-3)

Face Generation & DeepFake Detection

Exercises

Module 10

Advanced AI & Real-World Applications

Face Recognition (DeepFace, ArcFace)

Real-Time Image Processing on Edge Devices

AI for Healthcare & Medical Imaging

AI in Autonomous Vehicles & Robotics

AI-Powered Video Analysis & Enhancement

Ethics & Privacy in Computer Vision

Exercises

Module 11

OpenCV & AI Projects

Real-Time Face Mask Detection

Object Detection with YOLOv8 & DeepSORT

Automatic Number Plate Recognition

Optical Flow & Motion Tracking

Image Super-Resolution using AI

QR Code Scanner with AI

Style Transfer using GANs

Custom Diffusion Model for Image Generation

3D Object Reconstruction with NeRF

AI-Powered Image Captioning (BLIP)

Colorization of Black & White Images

Edge AI: Running AI Models on Raspberry Pi