NIT Rourkela

Course Details

Subject {L-T-P / C} : CS6608 : Explainable AI { 3-0-0 / 3}

Subject Nature : Theory

Coordinator : Dr. Puneet Kumar Jain

Syllabus

UNIT I: Introduction to Explainable AI
Science of Interpretable Machine Learning, Motivation, Challenges, and Mythos of Model Interpretability, Human Factors in Explainability, Interpreting Interpretability

UNIT II: Post hoc Explanations
Explaining the Predictions of Any Classifier, Pitfalls, Challenges, and Evaluation of Feature Attributions, LIME and SHAP, OpenXAI, The Disagreement Problem in Explainable Machine Learning, Counterfactual Explanations (or) Algorithmic Recourse, Learning Model-Agnostic Counterfactual Explanations for Tabular Data

UNIT III: Attention and Concept Based Explanations
Quantifying Interpretability of Deep Visual Representations, Interpretability Beyond Feature Attribution, Data Attribution and Interactive Explanation, Equitable Valuation of Data, Explainable Active Learning (XAL), Theory of Explainability and Interpreting Generative Models

UNIT IV: Explainability for Fair Machine Learning
Connections with Robustness, Privacy, Fairness, and Unlearning, Right to Explanation and the Right to be Forgotten, Fairness via Explanation Quality, Mechanistic Interpretability and Compiled Transformers, Understanding and Reasoning in Large Language Models

Course Objectives

Define and explain the importance of explainability in artificial intelligence.
Introduce and analyze various techniques for enhancing the interpretability of machine learning models.
Cover methods such as feature importance analysis, rule-based models, and surrogate models.
Provide hands-on experience in applying explainability techniques to real-world datasets and problems.

Course Outcomes

The course aims to achieve the following outcomes: CO1: Students will effectively apply interpretability techniques, such as feature importance analysis, rule-based models, and model-agnostic methods, to enhance the transparency of machine learning models. CO2: Students will possess the ability to evaluate and compare different interpretability methods. CO3: Students will develop a critical understanding of the trade-offs between model complexity and interpretability in diverse scenarios. CO4: Demonstrating practical skills, students will apply explainability techniques to real-world datasets and challenges. CO5: Students will gain awareness of the ethical implications related to AI transparency and interpretability.

Essential Reading

Michael Munn, David Pitman, Explainable AI for Practitioners, O'Reilly Media, Inc. , 2022
Leonida Gianfagna, Antonio Di Cecco, Explainable AI with Python, Springer , 2021

Supplementary Reading

Christoph Molnar, Interpretable Machine Learning, A Guide for Making Black Box Models Explainable, Leanpub , 2023
Denis Rothman, Hands-On Explainable AI (XAI) with Python: Interpret, visualize, explain, and integrate reliable AI for fair, secure, and trustworthy AI apps, Packt Publishing Limited , 2020

Journal and Conferences

Explainable Artificial Intelligence, ELSEVIER
AAAI Conference on Artificial Intelligence

National Institute of Technology Rourkela

राष्ट्रीय प्रौद्योगिकी संस्थान राउरकेला

ଜାତୀୟ ପ୍ରଯୁକ୍ତି ପ୍ରତିଷ୍ଠାନ ରାଉରକେଲା

About NITR

Administration

Key Documents

Accreditation

Academic

Technical Service Units

Departments

Academic Support Centres

Centres of Research

Facilities

Prospective Students

Existing Students

Other Related Links

Prospective Employee

Existing Employee

Useful Links

Retired Employee

Syllabus

Course Details

Syllabus

Course Objectives

Course Outcomes

Essential Reading

Supplementary Reading

Journal and Conferences

RELATED LINKS