0% found this document useful (0 votes)

53 views10 pages

Module 1 Xai

Explainable AI (XAI) aims to increase the transparency and understandability of artificial intelligence models. XAI tools and techniques help provide explanations for both the overall model and individual predictions in order to build trust, understand model limitations, and ensure fairness and accountability. As AI systems grow more complex and are used in more high-risk applications like healthcare, XAI becomes increasingly important to validate model performance and identify potential biases. A variety of model-specific and post-hoc explanation methods exist to interpret both intrinsically transparent models and highly complex "black box" models.

Uploaded by

ShibuCNarayanan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views10 pages

Module 1 Xai

Uploaded by

ShibuCNarayanan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

What is Explainable AI?

Artificial intelligence is becoming more complex and increasingly

implemented across society, which makes explainability even more crucial.
IBM provides a simple but effective definition for XAI:
”Explainable artificial intelligence (XAI) is a set of processes and
methods that allows human users to comprehend and trust the results
and output created by machine learning algorithms”
XAI helps describe an AI model, its expected impact and potential biases. All
of this leads to better model accuracy, fairness, transparency and outcomes
when AI is used for data-driven decision making.

Explainability is critical as AI algorithms take control of more applications and sectors,

which brings along the risk of bias, faulty algorithms, and various other issues. By ensuring

transparency for your company through explainability, you can truly leverage the power of

AI.
Explainable AI is not just one single tool but rather a set of tools and frameworks that help
you, your company and the public understand and interpret predictions made by machine

learning models.
AI models based decisions are still not mathematically completely explainable, there
is still not enough explicit declarative knowledge,

for example Neural Networks model treats high dimensional vectors making them
unintelligible to humans. Supervised machine learning algorithm takes an input X
and Output Y, decisions are made through input output mapping. These models come
up with the rules which are used for future prediction. Take a look at at them

Machine Learning

Fig. Simple neural Network

In deep neural networks there are several hidden nodes, layers, and weights,
there weights are continuously changing until high performance score is
achieved by comparing the model generated outputs with the original outputs.
Fig : Neural network : Loss Score is used as feedback Signal to adjust the weights

These machine learning and deep learning models show lack of transparency in their
decision making specially on high dimensional inputs, the problem is that there is no
explainability of the decision made, Such black box models cannot tells why they
made that decision , its upon user whether to take it or leave it. But in certain cases
user or more curios about the reason why this decision is made. for example , where a
decision can save or cost someone's life, here explainability will play a role, it will be
more reliable decision if model can generate satisfactory explanations about the
decision made, unless it will be hard to trust AI in such case.

An explanation is the answer to a why-question (Miller 2017)

 Why did not the treatment work on the patient?

 Why was my loan rejected?
 Why this email is mark as spammed?
How explainable AI works
With explainable AI – as well as interpretable machine learning – organizations can gain access to
AI technology’s underlying decision-making and are empowered to make adjustments. Explainable
AI can improve the user experience of a product or service by helping the end user trust that the AI
is making good decisions. When do AI systems give enough confidence in the decision that you can
trust it, and how can the AI system correct errors that arise?⁴
As AI becomes more advanced, ML processes still need to be understood and controlled to ensure
AI model results are accurate. Let’s look at the difference between AI and XAI, the methods and
techniques used to turn AI to XAI, and the difference between interpreting and explaining AI
processes

Comparing AI and XAI

What exactly is the difference between “regular” AI and explainable AI? XAI
implements specific techniques and methods to ensure that each decision made
during the ML process can be traced and explained. AI, on the other hand, often
arrives at a result using an ML algorithm, but the architects of the AI systems do not
fully understand how the algorithm reached that result. This makes it hard to check
for accuracy and leads to loss of control, accountability and auditability.

NEED OF XAI

 For the Sake of Social Responsibility, Fairness and Risk Avoidance.

Especially, within healthcare, clinical and justice work, risks and responsibility are a major
concern, as they are potentially dealing with human lives and not merely cost-benefit
analyses . Risk avoidance occurs as responsibility is assigned to the individual professional.
Hence, developing mental models for expert (e.g. clinical) reasoning to develop better
understanding of the reasoning behind deep neural networks and opaque models].

 Generate Accountable, Reliable and Sound Models for Justification.

A theme that has caused great attraction towards xAI is the possibility to ensure fairness and
unbiased models by auditing them or create proof of their rightfulness. this approach and
argue that xAI provide the required results for auditing the algorithms and generates a
provable way for defending algorithmic decisions as being fair and ethical. Hence,
generating algorithms that are not only fair and socially responsible, but also accountable
and able to justify their output is another aspect motivating the need for xAI.

 Minimize Biases and Misinterpretation in Model Performance and Interpretation.

Biases in models and their performance have shown to be an important driver for xAI, as
media coverage of models performing sub-par to humans in e.g. filtering out appropriate
candidates in hiring processes or failing at recognizing people of color. Especially when
dealing with neural network learning patterns from training data, biased training data
becomes an issue that impacts the validity of the model output .
Defining Interpretability, Explainability

Interpretability:

We consider a model intrinsically interpretable, if a human can understand the

internal workings of the model, either the entire model at once or at least the parts of
the model relevant for a given prediction. This may include understanding decision
rules and cutoffs and the ability to manually derive the outputs of the model. For
example, the scorecard for the recidivism model can be considered interpretable, as it
is compact and simple enough to be fully understood. Ideally, we even understand the
learning algorithm well enough to understand how the model’s decision boundaries
were derived from the training data — that is, we may not only understand a model’s
rules, but also why the model has these rules.

Explainability:

We consider a model explainable if we find a mechanism to provide (partial)

information about the workings of the model, such as identifying influential features.
We consider a model’s prediction explainable if a mechanism can provide (partial)
information about the prediction, such as identifying which parts of an input were
most important for the resulting prediction or which changes to an input would result
in a different prediction. For example, for the proprietary COMPAS model for
recidivism prediction, an explanation may indicate that the model heavily relies on
the age, but not the gender of the accused; for a single prediction made to assess the
recidivism risk of a person, an explanation may indicate that the large number of
prior arrests are the main reason behind the high risk score. Explanations can come in
many different forms, as text, as visualizations, or as examples. Explanations are
usually easy to derive from intrinsically interpretable models, but can be provided
also for models of which humans may not understand the internals. Explanations are
usually partial in nature and often approximated. The explanations may be divorced
from the actual internals used to make a decision; they are often called post-hoc
explanations.
How to think about explaining machine learning models
Before looking at specific techniques for explaining ML models, it will be helpful to
build a vocabulary that will help us think about what we can explain and how we may
go about it. This helps us consider what type of explanations we want and what meth-
ods are compatible with the model we have trained:

 Intrinsically explainable - An intrinsically explainable model is designed to be

simple and transparent enough that we can get a sense for how it works by look-
ing at its structure, e.g. simple regression models and small decision trees. These
models are directly interpretable.
 Post-hoc explainable - For more complicated, already trained models, we can
use explainability tools (often called interpretability tools) to obtain post-hoc ex-
planations. Explanations of sufficiently complex models such as deep neural
networks are always post-hoc explanations as they are not directly interpretable.

The types of ‘explanations’ typically fall into one of two categories:

 Global explanations - A global explanation of a ML model details what features

are important to the model overall. This can be measured by looking at effect
sizes or determining which features have the biggest impact on model accuracy.
Global explanations are helpful for guiding policy or finding evidence for, or re-
jecting a hypothesis that a particular feature is important. Figure (4) shows a
visualisation of a global explanation for a wine classification task.
Figure 4. An example of a global explanation for a multi-class classification problem.
The size of the horizontal bars indicate how much each feature (on average) influences
the classification of a wine into one of three classes.

 Local explanations - A local explanation details how a ML model arrived at a

specific prediction. For tabular data, it could be a list of features with their im-
pact on the prediction. For a computer vision task, it might be a subset of pixels
that had the biggest impact on the classification. Figure (5) shows an example of
some local explanations for model predictions for three unique instances. Local
explanations are useful for deep-dive insights or diagnosing issues and can pro-
vide answers to questions like:
o Why did the model return this output for this input?
o What if this feature had a different value?

Figure 5. Local explanations of the predictions for three different instances in a wine
classification task. The local explanations give the direction and magnitude each fea-
ture has on the model output relative to the baseline.

Some methods are a mix of both, providing detailed explanations of how a single fea-
ture or interaction of two features impacts a set of predictions. These are modular
global explanations because they can only be used to inspect the impact of one or two
features at a time.

The methods used to provide explanations are either model-specific or model-agnostic:

 Model-specific - Model-specific methods work by inspecting or having access

to the model internals. Interpreting regression coefficient weights or P-values in
a linear model or counting the number of times a feature is used in an ensemble
tree model are examples of model-specific methods.
 Model-agnostic - Model-agnostic methods work by investigating the relation-
ship between input-output pairs of trained models. They do not depend on the in-
ternal structure of the model. These methods are very useful for when we have
no theory or other mechanism to interpret what is happening inside the model.

Properties of Explanations
We want to explain the predictions of a machine learning model. To achieve this,
we rely on some explanation method, which is an algorithm that generates explan a-
tions. An explanation usually relates the feature values of an instance to its
model prediction in a humanly understandable way.
Properties of Explanation Methods

 Expressive Power is the “language” or structure of the explanations the method is

able to generate. An explanation method could generate IF-THEN rules, decision
trees, a weighted sum, natural language or something else.
 Translucency describes how much the explanation method relies on looking into
the machine learning model, like its parameters. For example, explanation methods
relying on intrinsically interpretable models like the linear regression model
(model-specific) are highly translucent. Methods only relying on manipulating in-
puts and observing the predictions have zero translucency. Depending on the sce-
nario, different levels of translucency might be desirable. The advantage of high
translucency is that the method can rely on more information to generate explana-
tions. The advantage of low translucency is that the explanation method is more
portable.
 Portability describes the range of machine learning models with which the expla-
nation method can be used. Methods with a low translucency have a higher por t-
ability because they treat the machine learning model as a black box. Surrogate
models might be the explanation method with the highest portability. Methods that
only work for e.g. recurrent neural networks have low portability.
 Algorithmic Complexity describes the computational complexity of the method
that generates the explanation. This property is important to consider when compu-
tation time is a bottleneck in generating explanations.

Properties of Individual Explanations

 Accuracy: How well does an explanation predict unseen data? High accuracy is
especially important if the explanation is used for predictions in place of the ma-
chine learning model. Low accuracy can be fine if the accuracy of the machine
learning model is also low, and if the goal is to explain what the black box model
does. In this case, only fidelity is important.
 Fidelity: How well does the explanation approximate the prediction of the black
box model? High fidelity is one of the most important properties of an explanation,
because an explanation with low fidelity is useless to explain the machine learning
model. Accuracy and fidelity are closely related. If the black box model has high
accuracy and the explanation has high fidelity, the explanation also has high accu-
racy. Some explanations offer only local fidelity, meaning the explanation only ap-
proximates well to the model prediction for a subset of the data (e.g. local surrogate
models) or even for only an individual data instance (e.g. Shapley Values).
 Consistency: How much does an explanation differ between models that have been
trained on the same task and that produce similar predictions? For example, I train
a support vector machine and a linear regression model on the same task and both
produce very similar predictions. I compute explanations using a method of my
choice and analyze how different the explanations are. If the explanations are very
similar, the explanations are highly consistent. I find this property somewhat tricky,
since the two models could use different features, but get similar predictions (also
called “Rashomon Effect”). In this case a high consistency is not desirable because
the explanations have to be very different. High consistency is desirable if the
models really rely on similar relationships.
 Stability: How similar are the explanations for similar instances? While consis-
tency compares explanations between models, stability compares explanations be-
tween similar instances for a fixed model. High stability means that slight varia-
tions in the features of an instance do not substantially change the explanation
(unless these slight variations also strongly change the prediction). A lack of stabil-
ity can be the result of a high variance of the explanation method. In other words,
the explanation method is strongly affected by slight changes of the feature values
of the instance to be explained. A lack of stability can also be caused by non-
deterministic components of the explanation method, such as a data sampling step,
like the local surrogate method uses. High stability is always desirable.
 Comprehensibility: How well do humans understand the explanations? This looks
just like one more property among many, but it is the elephant in the room. Diffi-
cult to define and measure, but extremely important to get right. Many people agree
that comprehensibility depends on the audience. Ideas for measuring comprehensi-
bility include measuring the size of the explanation (number of features with a non-
zero weight in a linear model, number of decision rules, …) or testing how well
people can predict the behavior of the machine learning model from the explana-
tions. The comprehensibility of the features used in the explanation should also be
considered. A complex transformation of features might be less comprehensible
than the original features.
 Certainty: Does the explanation reflect the certainty of the machine learning
model? Many machine learning models only give predictions without a statement
about the models confidence that the prediction is correct. If the model predicts a
4% probability of cancer for one patient, is it as certain as the 4% probability that
another patient, with different feature values, received? An explanation that in-
cludes the model’s certainty is very useful.
 Degree of Importance: How well does the explanation reflect the importance of
features or parts of the explanation? For example, if a decision rule is generated as
an explanation for an individual prediction, is it clear which of the conditions of the
rule was the most important?
 Novelty: Does the explanation reflect whether a data instance to be explained
comes from a “new” region far removed from the distribution of training data? In
such cases, the model may be inaccurate and the explanation may be useless. The
concept of novelty is related to the concept of certainty. The higher the novelty, the
more likely it is that the model will have low certainty due to lack of data.
 Representativeness: How many instances does an explanation cover? Explanations
can cover the entire model (e.g. interpretation of weights in a linear regression
model) or represent only an individual prediction (e.g. Shapley Values).

Explainable Artificial Intelligence (XAI) : Concepts, Taxonomies, Opportunities and Challenges Toward Responsible AI
No ratings yet
Explainable Artificial Intelligence (XAI) : Concepts, Taxonomies, Opportunities and Challenges Toward Responsible AI
72 pages
Python Notes
No ratings yet
Python Notes
77 pages
A Comparative Study and Systematic Analysis of XAI Models and Their Applications in Healthcare
No ratings yet
A Comparative Study and Systematic Analysis of XAI Models and Their Applications in Healthcare
26 pages
Practical Research 2 Notes 2018 Complete
100% (3)
Practical Research 2 Notes 2018 Complete
44 pages
How To Crack BITCOIN - Algorithm
0% (1)
How To Crack BITCOIN - Algorithm
3 pages
Unit 2
No ratings yet
Unit 2
32 pages
Explainable AI
100% (1)
Explainable AI
18 pages
306 Seminar Report
No ratings yet
306 Seminar Report
39 pages
Journal Pre-Proof: Information Fusion
No ratings yet
Journal Pre-Proof: Information Fusion
74 pages
Explainable AI: Advances & Challenges
No ratings yet
Explainable AI: Advances & Challenges
16 pages
H2 XAI Introduction
No ratings yet
H2 XAI Introduction
32 pages
Reaction Engineering Course Outline
No ratings yet
Reaction Engineering Course Outline
181 pages
XAI: Building Trust in AI Models
No ratings yet
XAI: Building Trust in AI Models
52 pages
Explainable AI
No ratings yet
Explainable AI
6 pages
Full Text 01
No ratings yet
Full Text 01
36 pages
DSAT Practice Test SAT #1
No ratings yet
DSAT Practice Test SAT #1
49 pages
Naked Statistics: Stripping The Dread From The Data Practical Business Statistics, Sixth Edition
No ratings yet
Naked Statistics: Stripping The Dread From The Data Practical Business Statistics, Sixth Edition
2 pages
Explainable AI
No ratings yet
Explainable AI
5 pages
Math Lesson Plan 6th Grade
75% (4)
Math Lesson Plan 6th Grade
2 pages
Explainable A I
No ratings yet
Explainable A I
11 pages
DPS FINAL MATHS PAPER 2023 (1) (Practice)
No ratings yet
DPS FINAL MATHS PAPER 2023 (1) (Practice)
4 pages
Unit 2 2
No ratings yet
Unit 2 2
39 pages
Explainable Ai in Transparency
No ratings yet
Explainable Ai in Transparency
2 pages
EEAI
No ratings yet
EEAI
50 pages
Vip No.5 - Mesl
No ratings yet
Vip No.5 - Mesl
4 pages
Explainable AI
No ratings yet
Explainable AI
41 pages
QUBE-Servo 2 - Second Order Systems Workbook (Student)
No ratings yet
QUBE-Servo 2 - Second Order Systems Workbook (Student)
6 pages
Explainable Artificial Intelligence (XAI) in Banking - Deloitte Insights
No ratings yet
Explainable Artificial Intelligence (XAI) in Banking - Deloitte Insights
16 pages
Explainable AI
No ratings yet
Explainable AI
4 pages
AI Explainability Whitepaper
No ratings yet
AI Explainability Whitepaper
27 pages
Unlocking The Black Box Explainable Arti
No ratings yet
Unlocking The Black Box Explainable Arti
6 pages
2F - LP Solution Techniques
100% (1)
2F - LP Solution Techniques
109 pages
Curriculum Map Grade 4 - Third Quarter
100% (1)
Curriculum Map Grade 4 - Third Quarter
4 pages
Worksheet - 1 Tangent - Normal
No ratings yet
Worksheet - 1 Tangent - Normal
11 pages
The - Essential - Guide - To - Explainable - AI 20241221
No ratings yet
The - Essential - Guide - To - Explainable - AI 20241221
71 pages
Explainable AI Decision Support
No ratings yet
Explainable AI Decision Support
3 pages
Advancing Transparency and Trust in AI The Role of Explainable Artificial Intelligence (XAI)
No ratings yet
Advancing Transparency and Trust in AI The Role of Explainable Artificial Intelligence (XAI)
8 pages
Explainable AI: A Comprehensive Review
No ratings yet
Explainable AI: A Comprehensive Review
66 pages
Paper 16988
No ratings yet
Paper 16988
9 pages
Explainability in Deep Reinforcement Learning: Version of Record
No ratings yet
Explainability in Deep Reinforcement Learning: Version of Record
24 pages
Explainable AI (XAI) : Core Ideas, Techniques, and Solutions
No ratings yet
Explainable AI (XAI) : Core Ideas, Techniques, and Solutions
33 pages
Explainable
No ratings yet
Explainable
5 pages
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
No ratings yet
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
41 pages
Model Explainablity
No ratings yet
Model Explainablity
7 pages
FSMQ Maxima and Minima PDF
No ratings yet
FSMQ Maxima and Minima PDF
5 pages
Explainable AI for Decision Makers
No ratings yet
Explainable AI for Decision Makers
34 pages
Optimization of The SWAT Model To Adequately Predict Different Segments of A Managed Streamflow Hydrograph
No ratings yet
Optimization of The SWAT Model To Adequately Predict Different Segments of A Managed Streamflow Hydrograph
21 pages
Counterfactuals and Causability in Explainable Artificial Intelligence Theory, Algorithms, and Applications
No ratings yet
Counterfactuals and Causability in Explainable Artificial Intelligence Theory, Algorithms, and Applications
59 pages
Evaluating Text Classification With Explainable Artificial Intelligence
No ratings yet
Evaluating Text Classification With Explainable Artificial Intelligence
9 pages
Shreya Cpntent - Merged
No ratings yet
Shreya Cpntent - Merged
20 pages
XAI105
No ratings yet
XAI105
12 pages
Explainability in Deep Reinforcement Learning
No ratings yet
Explainability in Deep Reinforcement Learning
25 pages
Explainable AI Overview: XAI, LIME, SHAP
No ratings yet
Explainable AI Overview: XAI, LIME, SHAP
6 pages
Algebra Notes From The Underground 1st Edition Paolo Aluffi Instant Download
No ratings yet
Algebra Notes From The Underground 1st Edition Paolo Aluffi Instant Download
82 pages
804YB Kendriya Vidyalaya Sangathan Hyderabad Region Common Summative Assessment - Ii
No ratings yet
804YB Kendriya Vidyalaya Sangathan Hyderabad Region Common Summative Assessment - Ii
8 pages
Aai Mid Sem
No ratings yet
Aai Mid Sem
39 pages
A Theoretical Framework For AI Models
No ratings yet
A Theoretical Framework For AI Models
9 pages
Explainable Artificial Intelligence and Machine Learning: A Reality Rooted Perspective
No ratings yet
Explainable Artificial Intelligence and Machine Learning: A Reality Rooted Perspective
8 pages
Explainable AI: Taxonomy & Best Practices
No ratings yet
Explainable AI: Taxonomy & Best Practices
10 pages
AI Week12
No ratings yet
AI Week12
17 pages
McKinsey202209 Why Businesses Need Explainable Ai and How To Deliver It
No ratings yet
McKinsey202209 Why Businesses Need Explainable Ai and How To Deliver It
5 pages
Entropy: Explainable AI: A Review of Machine Learning Interpretability Methods
No ratings yet
Entropy: Explainable AI: A Review of Machine Learning Interpretability Methods
45 pages
Shap Lime
No ratings yet
Shap Lime
6 pages
Explainable AI - A Brief Survey On History, Research Areas, Approaches and Challe
No ratings yet
Explainable AI - A Brief Survey On History, Research Areas, Approaches and Challe
12 pages
Explainable: By-Vikram Rajpurohit 23ADR185 Kongu Engineering College 2023-2027
No ratings yet
Explainable: By-Vikram Rajpurohit 23ADR185 Kongu Engineering College 2023-2027
10 pages
E3sconf Iconnect2023 04030
No ratings yet
E3sconf Iconnect2023 04030
9 pages
Cot Math 4 q2 - Week6 2022
No ratings yet
Cot Math 4 q2 - Week6 2022
12 pages
Explainable AI: Interpreting Black-Box Models
No ratings yet
Explainable AI: Interpreting Black-Box Models
30 pages
Experiment 4 - Numerical Differentiation
No ratings yet
Experiment 4 - Numerical Differentiation
6 pages
ORCA - Online Research at Cardiff
No ratings yet
ORCA - Online Research at Cardiff
35 pages
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
No ratings yet
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
8 pages
A Systematic Literature Review of Explainable Arti
No ratings yet
A Systematic Literature Review of Explainable Arti
30 pages
Overview ML Interpretability
No ratings yet
Overview ML Interpretability
10 pages
BIOSTATISTICS
No ratings yet
BIOSTATISTICS
55 pages
2850-Article Text-6600-1-10-20190624
No ratings yet
2850-Article Text-6600-1-10-20190624
15 pages
Explainable Artificial Intelligence Approaches
No ratings yet
Explainable Artificial Intelligence Approaches
14 pages
Talk MBA AI XAI 2 PDF
No ratings yet
Talk MBA AI XAI 2 PDF
76 pages
XAI P T A Brief Review of Explainable
No ratings yet
XAI P T A Brief Review of Explainable
9 pages
9-4 Notes PDF
No ratings yet
9-4 Notes PDF
18 pages
F2 Night Before Notes
No ratings yet
F2 Night Before Notes
11 pages
Grade 2 Class Prog
No ratings yet
Grade 2 Class Prog
1 page
Quantum Perturbation Theory
No ratings yet
Quantum Perturbation Theory
46 pages
Mechanical Engineering MCQs: Production Tech
No ratings yet
Mechanical Engineering MCQs: Production Tech
27 pages
wJycrDjdkdBcm0jhiuPq 231231 101917
No ratings yet
wJycrDjdkdBcm0jhiuPq 231231 101917
7 pages
The Secant Method
No ratings yet
The Secant Method
7 pages
Strain Gauge Catalog for Engineers
No ratings yet
Strain Gauge Catalog for Engineers
92 pages
2021 Article
No ratings yet
2021 Article
17 pages
Intro To Real External Flows Lesson 1 PDF
No ratings yet
Intro To Real External Flows Lesson 1 PDF
11 pages