Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Last update: Jan 02, 2023

Overview

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

Telegram chat room (russian).
YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
week06 Visualizing and understanding deep architectures, adversarial examples
week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
week09 Latent models (GLO, AEs, flow models, diffusion models, VQ-VAE, generative transformers, CLIP, DALL-E)
week10 Generative adversarial networks
week11 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
week12 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

Victor Lempitsky - all main track lectures
Victor Yurchenko - seminars, homeworks, admin stuff
Fedor Ratnikov - seminars, homeworks, admin staff
To be continued

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Related tags

Overview

Deep Vision and Graphics

General info

Syllabus

Contributors & course staff

Owner

Yandex School of Data Analysis

This is an official implementation for "Video Swin Transformers".

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

Multi-query Video Retreival

OMNIVORE is a single vision model for many different visual modalities

The official re-implementation of the Neurips 2021 paper, "Targeted Neural Dynamical Modeling".

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Semiconductor Machine learning project

Listing arxiv - Personalized list of today's articles from ArXiv

Offical implementation of Shunted Self-Attention via Multi-Scale Token Aggregation

Reinforcement Learning for finance

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

The Python3 import playground

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

Tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Submanifold sparse convolutional networks