0% found this document useful (0 votes)

46 views5 pages

Tata Data Analytics Glossary

The document is a glossary of key terms related to performance measurement and fairness in AI, including definitions for metrics like AUC-ROC, accuracy, and F1 score. It also covers concepts such as agentic AI, bias, and various data handling techniques like imputation and bootstrapping. Additionally, it addresses issues of fairness and disparate impact in AI systems, emphasizing the importance of equitable decision-making.

Uploaded by

Vineeth Vivekanandan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views5 pages

Tata Data Analytics Glossary

Uploaded by

Vineeth Vivekanandan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Glossary of Terms

A performance measurement for

AUC-ROC (Area Under the Receiver classification models at various
Operating Characteristic Curve) threshold settings. AUC represents the
degree of separability between classes.
A metric that measures the proportion
Accuracy of correct predictions made by a model
compared to the total number of
predictions.
Artificial intelligence systems that can
Agentic AI make autonomous decisions based on
goals, feedback, and context—similar
to how a human agent would operate.
A systematic error in a model that
Bias (in AI) leads to unfair outcomes for certain
groups, often caused by historical data
or skewed training sets.
A resampling method used to estimate
Bootstrapping the uncertainty of a statistic by
repeatedly sampling from the original
dataset (with replacement) to create
many new datasets. The statistic (e.g.,
mean or standard deviation) is
calculated for each resample, allowing
a probability distribution to be built and
uncertainty to be assessed.
A table used to describe the
Confusion Matrix performance of a classification model,
showing the true positives, true
negatives, false positives, and false
negatives.
The ratio of a borrower's current credit
Credit Utilization card balances to their credit limits,
often used as an indicator of credit risk.
The process of replacing missing or
Data Imputation incomplete data with substituted values
to maintain the integrity of the dataset.
A machine learning model that splits
Decision Tree data into branches to reach a decision
based on input variables. It’s valued for
its interpretability.
The failure to make required debt
Delinquency payments on time, typically used in
credit risk assessments.
A fairness metric that is satisfied if the
Demographic parity results of a model's classification are
not dependent on a given sensitive
attribute.
A measure of how evenly positive
Disparate impact outcomes are distributed across
different groups. If one group gets
positive results much less often than
another, it may suggest unfair
treatment or bias.
The process of analyzing datasets to
EDA (Exploratory Data Analysis) summarize their main characteristics
and uncover patterns before applying
formal modeling.
The harmonic mean of precision and
F1 Score recall, providing a balance between the
two metrics for evaluating classification
models.
Ensuring that AI systems do not
Fairness (in AI) discriminate against individuals or
groups and that decisions are
equitable.
The process of optimizing model
Hyperparameter Tuning settings to improve its performance,
such as adjusting tree depth or learning
rate.
It refers to a situation where the
Imbalanced Data distribution of classes in a dataset is
highly skewed, with one class
significantly outnumbering the other(s).
In statistics, imputation is the process
Imputation of replacing missing values with
substituted values.
A statistical model used for binary
Logistic Regression classification tasks, predicting the
probability of one of two outcomes.
Instances where no data value is
Missing Data stored for a variable in an observation.
These gaps can impact model quality.
There three different missing data
mechanisms:
 Missing Completely at
Random (MCAR): Data is
considered MCAR when the
reason for the missing values is
unrelated to any other data in
the dataset. The missingness
happens by pure chance,
without any pattern.
 Missing at Random (MAR):
Data is considered MAR when
the reason values are missing is
related to other information in
the dataset that isn't missing. If
we know the values of some
complete variables, we can
explain why other values are
missing.
 Missing Not at Random
(MNAR): Data is considered
MNAR when the reason it's
missing is related to the missing
value itself. In other words, the
missingness depends on
information we don’t have.
A mathematical technique that
Monte Carlo simulation simulates the range of possible
outcomes for an uncertain event.
The percentage of true positive
Precision predictions among all positive
predictions made by the model.
Using historical data and algorithms to
Predictive Modeling forecast future outcomes, such as
customer delinquency.
The percentage of actual positive
Recall cases that were correctly identified by
the model.
A tool used to explain the output of
SHAP (Shapley Additive machine learning models by assigning
Explanations) each feature an importance value.
Artificially generated data that mimics real-
Synthetic Data world patterns and distributions, used
when actual data is limited or sensitive.

EDA SummaryReport Template
No ratings yet
EDA SummaryReport Template
2 pages
It Ix Sa1 Sample Paper
No ratings yet
It Ix Sa1 Sample Paper
3 pages
EDA Example Answer
No ratings yet
EDA Example Answer
3 pages
Task 2 Model Plan Example Answer
No ratings yet
Task 2 Model Plan Example Answer
1 page
Elektor Electronics 2020-07 08 USA
100% (1)
Elektor Electronics 2020-07 08 USA
116 pages
7383469-AI Handout PartB Unit-3 Evaluating Models
100% (1)
7383469-AI Handout PartB Unit-3 Evaluating Models
16 pages
Types of Resorts by Seasonality
50% (2)
Types of Resorts by Seasonality
34 pages
Learning and Behavior 9th Edition Full Version Download
82% (11)
Learning and Behavior 9th Edition Full Version Download
17 pages
Chapter 3
No ratings yet
Chapter 3
25 pages
IIT JEE Organic Chemistry Solutions
100% (3)
IIT JEE Organic Chemistry Solutions
15 pages
Certified Safety Professional Certificat
No ratings yet
Certified Safety Professional Certificat
4 pages
Bank Abbreviations PDF
No ratings yet
Bank Abbreviations PDF
6 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Ai Model Validation
No ratings yet
Ai Model Validation
32 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Unit 4
No ratings yet
Unit 4
20 pages
What Are The Basic Concepts in Machine Learning
No ratings yet
What Are The Basic Concepts in Machine Learning
3 pages
Statement of Purpose (Ashok)
No ratings yet
Statement of Purpose (Ashok)
2 pages
7118 Ds Methodology Ss
No ratings yet
7118 Ds Methodology Ss
56 pages
Etman MachineL 3
No ratings yet
Etman MachineL 3
47 pages
Physical Education Revision
No ratings yet
Physical Education Revision
3 pages
Effect of Niobium On The As-Cast Microstructure of Hypereutectic High Chromium Cast Iron
No ratings yet
Effect of Niobium On The As-Cast Microstructure of Hypereutectic High Chromium Cast Iron
4 pages
M1000H
No ratings yet
M1000H
2 pages
Part B Unit 3
No ratings yet
Part B Unit 3
23 pages
Unit Iii
No ratings yet
Unit Iii
67 pages
Worksheet For 8th
No ratings yet
Worksheet For 8th
5 pages
BI NEP Unit 2
No ratings yet
BI NEP Unit 2
22 pages
Evaluationnai
No ratings yet
Evaluationnai
5 pages
5 - InnovatiCS - Data Types - Measure of Shape - Position - Dispersion
No ratings yet
5 - InnovatiCS - Data Types - Measure of Shape - Position - Dispersion
47 pages
Updated Dataset Description Guide
No ratings yet
Updated Dataset Description Guide
1 page
Rabbit Silage Study
No ratings yet
Rabbit Silage Study
36 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
4-5 Units Fds
No ratings yet
4-5 Units Fds
13 pages
03 Data Science Process - Fall 23-24
No ratings yet
03 Data Science Process - Fall 23-24
38 pages
CRM Assignment: Key Concepts Quiz
100% (2)
CRM Assignment: Key Concepts Quiz
28 pages
Data Imbalance Problem
No ratings yet
Data Imbalance Problem
14 pages
3 DM Classification
No ratings yet
3 DM Classification
62 pages
03 Data Science Process - Spring-24-25
No ratings yet
03 Data Science Process - Spring-24-25
48 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Data Science Terms Pocket Guide
No ratings yet
Data Science Terms Pocket Guide
28 pages
Ai Project Cycle Short Note
No ratings yet
Ai Project Cycle Short Note
9 pages
Imputation Guide Handout
No ratings yet
Imputation Guide Handout
1 page
Unit 3
No ratings yet
Unit 3
13 pages
Unit I Data Analytics
No ratings yet
Unit I Data Analytics
46 pages
Dr. Dubacharla Gyaneshwar
No ratings yet
Dr. Dubacharla Gyaneshwar
30 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
DS Unit 3 QB
No ratings yet
DS Unit 3 QB
17 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
19 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
ML MAKAUT Unit-3
No ratings yet
ML MAKAUT Unit-3
6 pages
DMML Assignment
No ratings yet
DMML Assignment
3 pages
Credit Card Fraud Detection - Final
No ratings yet
Credit Card Fraud Detection - Final
3 pages
v2QsWP7eSuSCZzIB2RRzJg Course-2-Glossary
No ratings yet
v2QsWP7eSuSCZzIB2RRzJg Course-2-Glossary
14 pages
It 311-Ads Module 5
No ratings yet
It 311-Ads Module 5
9 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
Glossary of Problem & Approach
No ratings yet
Glossary of Problem & Approach
3 pages
Unit 3
No ratings yet
Unit 3
28 pages
Circles The Final Steps (MCQ'S) Ws
No ratings yet
Circles The Final Steps (MCQ'S) Ws
9 pages
BigData QB (C.format)
No ratings yet
BigData QB (C.format)
6 pages
Glossary of Terms Journal of Machine Learning
No ratings yet
Glossary of Terms Journal of Machine Learning
4 pages
Advanced Data Analytics Certificate Glossary
No ratings yet
Advanced Data Analytics Certificate Glossary
35 pages
How To Evaluate and Monitor Performance of AI Models For Financial Risk Management - A Practical Guide by Indraneel Dutta Barua
No ratings yet
How To Evaluate and Monitor Performance of AI Models For Financial Risk Management - A Practical Guide by Indraneel Dutta Barua
1 page
Task-2 Example Code
No ratings yet
Task-2 Example Code
8 pages
The Ultimate Glossary
No ratings yet
The Ultimate Glossary
9 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Procesos A Color
No ratings yet
Procesos A Color
6 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Data Science Glossary for Analysts
No ratings yet
Data Science Glossary for Analysts
8 pages
Intro to Machine Learning Steps
No ratings yet
Intro to Machine Learning Steps
35 pages
STID1103 SYLLABUS A211 Student
No ratings yet
STID1103 SYLLABUS A211 Student
5 pages
TTDS Lectures
No ratings yet
TTDS Lectures
13 pages
FUN Transmissions: by Bill Brayton
No ratings yet
FUN Transmissions: by Bill Brayton
4 pages
Course 3 Glossary
No ratings yet
Course 3 Glossary
10 pages
Unit 4 DWDM
No ratings yet
Unit 4 DWDM
8 pages
Data Science & Analytics Basics
No ratings yet
Data Science & Analytics Basics
71 pages
Glossary For Isye 6501 Introduction To Analytics Modeling
No ratings yet
Glossary For Isye 6501 Introduction To Analytics Modeling
24 pages
Task 2 - ModelPlan - Template
No ratings yet
Task 2 - ModelPlan - Template
1 page
Vehicle Collision With Student Pedestrians Crossing in Rochester Indiana NTSB Report
No ratings yet
Vehicle Collision With Student Pedestrians Crossing in Rochester Indiana NTSB Report
70 pages
Big Data Lesson 2 Lucrezia Noli
No ratings yet
Big Data Lesson 2 Lucrezia Noli
21 pages
High Pass Filter
No ratings yet
High Pass Filter
12 pages
Lesson Plan
No ratings yet
Lesson Plan
8 pages
Soalan KBAT Biologi 2015: Organisasi Sel
No ratings yet
Soalan KBAT Biologi 2015: Organisasi Sel
5 pages
Trial Memorandum Plaintiff SAMPLE
No ratings yet
Trial Memorandum Plaintiff SAMPLE
9 pages
Tilting Vice PDF
No ratings yet
Tilting Vice PDF
33 pages
Mechatronics Project: Linear Displacement Indicator
No ratings yet
Mechatronics Project: Linear Displacement Indicator
6 pages
Anticipation Guide-Phonics and Word Recognition
No ratings yet
Anticipation Guide-Phonics and Word Recognition
5 pages
Radiant July 2018
No ratings yet
Radiant July 2018
18 pages
How To Mount A Remote File System Using Network File System (NFS)
No ratings yet
How To Mount A Remote File System Using Network File System (NFS)
3 pages
Pawan Transfer
No ratings yet
Pawan Transfer
2 pages
Dialogue Completion & Reading Comprehension
0% (1)
Dialogue Completion & Reading Comprehension
8 pages
EEE229/EEE223/GEE202 - Problem Sheet 1
No ratings yet
EEE229/EEE223/GEE202 - Problem Sheet 1
1 page
Data Mining & Agent Selection Guide
No ratings yet
Data Mining & Agent Selection Guide
8 pages
Technical and Grammar Quiz
No ratings yet
Technical and Grammar Quiz
3 pages

Tata Data Analytics Glossary

Uploaded by

Tata Data Analytics Glossary

Uploaded by

Glossary of Terms

A performance measurement for

You might also like