National Institute of Technology Rourkela

राष्ट्रीय प्रौद्योगिकी संस्थान राउरकेला

ଜାତୀୟ ପ୍ରଯୁକ୍ତି ପ୍ରତିଷ୍ଠାନ ରାଉରକେଲା

An Institute of National Importance

Syllabus

Course Details

Subject {L-T-P / C} : CS6608 : Explainable AI { 3-0-0 / 3}

Subject Nature : Theory

Coordinator : Dr. Puneet Kumar Jain

Syllabus

UNIT I: Introduction to Explainable AI
Science of Interpretable Machine Learning, Motivation, Challenges, and Mythos of Model Interpretability, Human Factors in Explainability, Interpreting Interpretability

UNIT II: Post hoc Explanations
Explaining the Predictions of Any Classifier, Pitfalls, Challenges, and Evaluation of Feature Attributions, LIME and SHAP, OpenXAI, The Disagreement Problem in Explainable Machine Learning, Counterfactual Explanations (or) Algorithmic Recourse, Learning Model-Agnostic Counterfactual Explanations for Tabular Data

UNIT III: Attention and Concept Based Explanations
Quantifying Interpretability of Deep Visual Representations, Interpretability Beyond Feature Attribution, Data Attribution and Interactive Explanation, Equitable Valuation of Data, Explainable Active Learning (XAL), Theory of Explainability and Interpreting Generative Models

UNIT IV: Explainability for Fair Machine Learning
Connections with Robustness, Privacy, Fairness, and Unlearning, Right to Explanation and the Right to be Forgotten, Fairness via Explanation Quality, Mechanistic Interpretability and Compiled Transformers, Understanding and Reasoning in Large Language Models

Course Objectives

  • Define and explain the importance of explainability in artificial intelligence.
  • Introduce and analyze various techniques for enhancing the interpretability of machine learning models.
  • Cover methods such as feature importance analysis, rule-based models, and surrogate models.
  • Provide hands-on experience in applying explainability techniques to real-world datasets and problems.

Course Outcomes

The course aims to achieve the following outcomes: <br />CO1: Students will effectively apply interpretability techniques, such as feature importance analysis, rule-based models, and model-agnostic methods, to enhance the transparency of machine learning models. <br />CO2: Students will possess the ability to evaluate and compare different interpretability methods. <br />CO3: Students will develop a critical understanding of the trade-offs between model complexity and interpretability in diverse scenarios. <br />CO4: Demonstrating practical skills, students will apply explainability techniques to real-world datasets and challenges. <br />CO5: Students will gain awareness of the ethical implications related to AI transparency and interpretability.

Essential Reading

  • Michael Munn, David Pitman, Explainable AI for Practitioners, O'Reilly Media, Inc. , 2022
  • Leonida Gianfagna, Antonio Di Cecco, Explainable AI with Python, Springer , 2021

Supplementary Reading

  • Christoph Molnar, Interpretable Machine Learning, A Guide for Making Black Box Models Explainable, Leanpub , 2023
  • Denis Rothman, Hands-On Explainable AI (XAI) with Python: Interpret, visualize, explain, and integrate reliable AI for fair, secure, and trustworthy AI apps, Packt Publishing Limited , 2020

Journal and Conferences

  • Explainable Artificial Intelligence, ELSEVIER
  • AAAI Conference on Artificial Intelligence