National Institute of Technology Rourkela

राष्ट्रीय प्रौद्योगिकी संस्थान राउरकेला

ଜାତୀୟ ପ୍ରଯୁକ୍ତି ପ୍ରତିଷ୍ଠାନ ରାଉରକେଲା

An Institute of National Importance
NIT Rourkela Inside Page Banner

Syllabus

Course Details

Subject {L-T-P / C} : CS6510 : Deep Learning { 3-0-0 / 3}

Subject Nature : Theory

Coordinator : Sibarama Panigrahi

Syllabus

Module 1 :

History of Deep Learning, Deep Learning Success Stories, McCulloch Pitts Neuron, Thresholding Logic, Perceptrons, Perceptron Learning Algorithm and Convergence, Multilayer Perceptrons (MLPs), Representation Power of MLPs, Sigmoid Neurons, Gradient Descent, Feedforward Neural Networks, Representation Power of Feedforward Neural Networks, Feedforward Neural Networks, Backpropagation.

Module 2 :

Optimizers: Gradient Descent (GD), Momentum-Based GD, Nesterov Accelerated GD, Stochastic GD, AdaGrad, RMSProp, Adam, Learning Rate Scheduling.
Bias Variance Tradeoff, L2 regularization, Early stopping, Dataset augmentation, Parameter sharing and tying, Injecting noise at input, Ensemble methods, Dropout, Greedy Layerwise Pre-training, Activation Functions, Weight Initialization methods, Batch Normalization.

Module 3 :

Convolutional Neural Networks (CNNs): Convolution for Images, Padding and Strides, Pooling, LeNet, Modern CNNs: AlexNet, VGGNet, GoogLeNet, ResNet, DenseNet, Visualizing Convolutional Neural Networks, Guided Backpropagation, Deep Dream, Deep Art, Fooling Convolutional Neural Networks, Transfer Learning, Early fusion, Late fusion, Intermediate fusion, Learning Vectorial Representations of Words.
Hyperparameter optimization: Grid Search, Random Search, Bayesian Optimization.

Module 4 :

Recurrent Neural Networks(RNNs): Working with Sequences, Converting Raw Text to Sequence Data, Language Models, Backpropagation Through Time (BPTT), Vanishing and Exploding Gradients, Truncated BPTT, Long Short Term Memory (LSTM) Network, Gated Recurrent Units (GRUs), Deep RNNs, Solving the vanishing gradient problem with LSTMs, Bidirectional RNNs, Bidirectional Models (BiLSTM, BiGRU).
Encoder Decoder Models, Attention Mechanism, Attention over images, Hierarchical Attention, Attention Pooling by Similarity, Attention Scoring Functions, The Bahdanau Attention Mechanism, Multi-Head Attention, Self-Attention and Positional Encoding, The Transformer Architecture, Transformers for Vision, Large-Scale Pretraining with Transformers.

Course Objective

1 .

Understand the fundamentals of feedforward, recurrent and convolutional neural networks and apply them to solve various real-world problems.

2 .

Develop a deeper understanding in different optimization algorithms gradient descent, stochastic gradient descent and recent developments like ADAM and RMSprop.

3 .

Explore state-of-the-art architectures (CNN, LSTM, GRU, Bidirectional Models, and Transformer Models) and learn transfer learning with feature and decision level fusion.

4 .

Learn to improve the computational efficiency and performance of DL models for a specific problem by optimizing hyperparameters and employing GPUs

Course Outcome

1 .

Develop in depth understanding of the key deep learning models and concepts.

2 .

Gain practical skills and theoretical knowledge necessary to apply deep learning techniques to a wide range of problems.

3 .

Improve the performance (tuning hyperparameters) and computational efficiency (using GPUs) of deep learning models for real-world problems.

4 .

Determining an appropriate deep learning model for an application.

Essential Reading

1 .

A. Zhang, Z.C. Lipton, M. Li, A.J. Smola, Dive into Deep Learning, Cambridge University Press , Website: https://d2l.ai

2 .

Ian Goodfellow, Yoshua Bengio, Aaron Courville, Deep Learning, MIT Press , Website: https://www.deeplearningbook.org/

3 .

Charu C. Aggarwal , Neural Networks and Deep Learning, Springer

Supplementary Reading

1 .

Francois Chollet, Deep Learning with Python, Manning Publishers

2 .

E. Stevens, L. Antiga, T. Viehmann, Deep Learning with PyTorch, Manning Publishers

Journal and Conferences

1 .

IEEE Transactions on Neural Networks and Learning Systems

2 .

IEEE Transactions on Pattern Analysis and Machine Intelligence

3 .

AAAI Conference on Artificial Intelligence