0% found this document useful (0 votes)

7 views62 pages

Ethics

The document discusses the ethical implications of Natural Language Processing (NLP) systems, particularly how they can affect individuals' lives through biased data and decision-making processes. It highlights various sources of bias in machine learning, including biased training data, objectives, and amplification of biases in models. The document emphasizes the importance of addressing these biases to ensure fair and equitable outcomes in NLP applications.

Uploaded by

codersento

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views62 pages

Ethics

Uploaded by

codersento

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

ethics in NLP

CS 685, Spring 2022

Introduction to Natural Language Processing
http://people.cs.umass.edu/~miyyer/cs685/

Mohit Iyyer
College of Information and Computer Sciences
University of Massachusetts Amherst

many slides from Yulia Tsvetkov & Mark Yatskar

OpenAI PALMS: https://
openai.com/blog/improving-
language-model-behavior/

Fine-tune LMs on values-

targeted datasets
Fine-tune on small set of QA pairs

Solaiman & Dennison, 2021

And change the behavior of the model!

Solaiman & Dennison, 2021

Solaiman & Dennison, 2021
Demo: https://delphi.allenai.org/
what are we talking about today?
• many NLP systems affect actual people
• systems that interact with people (conversational agents)
• perform some reasoning over people (e.g.,
recommendation systems, targeted ads)
• make decisions about people’s lives (e.g., parole
decisions, employment, immigration)

• questions of ethics arise in all of these applications!

why are we talking about it?
• the explosion of data, in particular user-generated
data (e.g., social media)

• machine learning models that leverage huge amounts

of this data to solve certain tasks
Learn to Assess AI Systems Adversarially

● Who could benefit from such a technology?

● Who can be harmed by such a technology?

● Representativeness of training data

● Could sharing this data have major effect on people’s lives?

● What are confounding variables and corner cases to control for?

● Does the system optimize for the “right” objective?
● Could prediction errors have major effect on people’s lives?
let’s start with the data…
BIASED

A
I
Online data is riddled with SOCIAL STEREOTYPES
Racial Stereotypes

● June 2016: web search query “three black teenagers”

Gender/Race/Age Stereotypes

● June 2017: image search query “Doctor”

Gender/Race/Age Stereotypes
● June 2017: image search query “Nurse”
Gender/Race/Age Stereotypes

● June 2017: image search query “Homemaker”

Gender/Race/Age Stereotypes
● June 2017: image search query “CEO”
BIASED

A
I
Consequence: models are biased
Gender Biases on the Web

● The dominant class is often portrayed and perceived as relatively more

professional (Kay, Matuszek, and Munson 2015)
● Males are over-represented in the reporting of web-based news articles
(Jia, Lansdall-Welfare, and Cristianini 2015)
● Males are over-represented in twitter conversations (Garcia, Weber, and
Garimella 2014)
● Biographical articles about women on Wikipedia disproportionately discuss
romantic relationships or family-related issues (Wagner et al. 2015)
● IMDB reviews written by women are perceived as less useful (Otterbacher
2013)
Biased NLP Technologies

● Bias in word embeddings (Bolukbasi et al. 2017; Caliskan et al.

2017; Garg et al. 2018)
● Bias in Language ID (Blodgett & O'Connor. 2017; Jurgens et al.
2017)
● Bias in Visual Semantic Role Labeling (Zhao et al. 2017)
● Bias in Natural Language Inference (Rudinger et al. 2017)
● Bias in Coreference Resolution (At NAACL: Rudinger et al. 2018;
Zhao et al. 2018 )
● Bias in Automated Essay Scoring (At NAACL: Amorim et al. 2018)
Zhao et al., NAACL 2018
Sources of Human Biases in Machine Learning

● Bias in data and sampling

● Optimizing towards a biased objective

● Inductive bias

● Bias amplification in learned models

Sources of Human Biases in Machine Learning

● Bias in data and sampling

● Optimizing towards a biased objective

● Inductive bias

● Bias amplification in learned models

Types of Sampling Bias in Naturalistic Data
● Self-Selection Bias
○ Who decides to post reviews on Yelp and why?
Who posts on Twitter and why?
● Reporting Bias
○ People do not necessarily talk about things in the world in
proportion to their empirical distributions
(Gordon and Van Durme 2013)

● Proprietary System Bias

○ What results does Twitter return for a particular
query of interest and why? Is it possible to know?

● Community / Dialect / Socioeconomic Biases

○ What linguistic communities are over- or under-represented?
leads to community-specific model performance (Jorgensen et al. 2015)
credit: Brendan O’Connor
Example: Bias in Language Identification

● Most applications employ off-the-shelf LID systems which

are highly accurate

*Slides on LID by David Jurgens

(Jurgens et al. ACL’17)
McNamee, P., “Language identification: a solved problem suitable
for undergraduate instruction” Journal of Computing Sciences in
Colleges 20(3) 2005.

“This paper describes […]

how even the most simple of
these methods using data
obtained from the World
Wide Web achieve accuracy
approaching 100% on a test
suite comprised of ten
European languages”
● Language identification degrades significantly on African American
Vernacular English
(Blodgett et al. 2016) Su-Lin Blodgett just got her PhD from UMass!
LID Usage Example: Health Monitoring
LID Usage Example: Health Monitoring
Socioeconomic Bias in Language Identification

● Off-the-shelf LID systems under-represent populations in

less-developed countries

Jurgens et al. ACL’17

Better Social Representation through
Network-based Sampling

● Re-sampling from strategically-diverse corpora

Topical Geographic

Socia
Multilingual
l

Jurgens et al. ACL’17

Estimated accuracy for
English tweets

Human Development Index of

text’s origin country

Jurgens et al. ACL’17

Sources of Human Biases in Machine Learning

● Bias in data and sampling

● Optimizing towards a biased objective

● Inductive bias

● Bias amplification in learned models

Optimizing Towards a Biased Objective

● Northpointe vs ProPublica
Optimizing Towards a Biased Objective

“what is the probability that this person will commit a serious

crime in the future, as a function of the sentence you give
them now?”
Optimizing Towards a Biased Objective

“what is the probability that this person will commit a serious crime
in the future, as a function of the sentence you give them now?”

● COMPAS system
○ balanced training data about people of all races
○ race was not one of the input features
● Objective function
○ labels for “who will commit a crime” are unobtainable
○ a proxy for the real, unobtainable data: “who is more likely to be
convicted”

what are some issues with

this proxy objective?
Predicting prison sentences
given case descriptions

Chen et al., EMNLP 2019, “Charge-based prison term prediction…”

Is this sufficient consideration of ethical
issues of this work? Should the work
have been done at all?

Chen et al., EMNLP 2019, “Charge-based prison term prediction…”

Sources of Human Biases in Machine Learning

● Bias in data and sampling

● Optimizing towards a biased objective

● Inductive bias

● Bias amplification in learned models

what is inductive bias?
• the assumptions used by our model. examples:

• recurrent neural networks for NLP assume that the

sequential ordering of words is meaningful

• features in discriminative models are assumed to be

useful to map inputs to outputs
Bias in Word Embeddings

1. Caliskan, A., Bryson, J. J. and Narayanan, A. (2017) Semantics derived

automatically from language corpora contain human-like biases.
Science

2. Bolukbasi T., Chang K.-W., Zou J., Saligrama V., Kalai A. (2016) Man is to
Computer Programmer as Woman is to Homemaker? Debiasing Word
Embeddings. NIPS

3. Nikhil Garg, Londa Schiebinger, Dan Jurafsky, James Zou. (2018) Word
embeddings quantify 100 years of gender and ethnic stereotypes.
PNAS.
Biases in Embeddings: Another Take
Towards Debiasing

1. Identify gender subspace: B

Gender Subspace

The top PC captures the gender

subspace
Towards Debiasing

1. Identify gender subspace: B

2. Identify gender-definitional (S) and gender-neutral
words (N)
Gender-definitional vs. Gender-neutral Words
Towards Debiasing

1. Identify gender subspace: B

2. Identify gender-definitional (S) and gender-neutral words
(N)
3. Apply transform matrix (T) to the embedding matrix (W)
such that
a. Project away the gender subspace B from the gender-neutral words N
b. But, ensure the transformation doesn’t change the embeddings too much

Don’t modify Minimize gender

embeddings too component
much

T - the desired debiasing transformation B - biased space

W - embedding matrix
N - embedding matrix of gender neutral words
Sources of Human Biases in Machine Learning

● Bias in data and sampling

● Optimizing towards a biased objective

● Inductive bias

● Bias amplification in learned models

Bias Amplification

Zhao, J., Wang, T., Yatskar, M., Ordonez, V and Chang, M.-
W. (2017) Men Also Like Shopping: Reducing Gender
Bias Amplification using Corpus-level Constraint.
EMNLP
imSitu Visual Semantic Role Labeling (vSRL)

Slides by Mark Yatskar https://homes.cs.washington.edu/~my89/talks/ZWYOC17_slide.pdf

imSitu Visual Semantic Role Labeling (vSRL)
Dataset Gender Bias
Model Bias After Training
Algorithmic Bias
Quantifying Dataset Bias
Quantifying Dataset Bias: Dev Set
Model Bias Amplification
Reducing Bias Amplification (RBA)
Discussion

● Applications that are built from online data, generated by

people, learn also real-world stereotypes
● Should our ML models represent the “real world”?
● Or should we artificially skew data distribution?
● If we modify our data, what are guiding principles on what
our models should or shouldn't learn?

Biases NLP Global Congress 2023 Scrivner Solutions
No ratings yet
Biases NLP Global Congress 2023 Scrivner Solutions
21 pages
14 Ethics
No ratings yet
14 Ethics
84 pages
Bias in NLP
No ratings yet
Bias in NLP
44 pages
Anjali Case Study Synopsis PDF
No ratings yet
Anjali Case Study Synopsis PDF
11 pages
AI Ethical Consideration
No ratings yet
AI Ethical Consideration
22 pages
AI Bias
No ratings yet
AI Bias
16 pages
Bias in Predictive Algorithms
No ratings yet
Bias in Predictive Algorithms
12 pages
Johnson, Gabbrielle (2020) - Algorithmic Bias - On The Implicit Biases of Social Technology (Synthese) .2up
No ratings yet
Johnson, Gabbrielle (2020) - Algorithmic Bias - On The Implicit Biases of Social Technology (Synthese) .2up
11 pages
2.0 - Prompt Engineering Bias - What We Know So Far
No ratings yet
2.0 - Prompt Engineering Bias - What We Know So Far
80 pages
Johnson 2020 Algorithmic Bias
No ratings yet
Johnson 2020 Algorithmic Bias
21 pages
15 Fairness
No ratings yet
15 Fairness
45 pages
2024 cl-3 8
No ratings yet
2024 cl-3 8
83 pages
ML&SC
No ratings yet
ML&SC
21 pages
LLM Bias
No ratings yet
LLM Bias
79 pages
LectureW14 BiasEthics
No ratings yet
LectureW14 BiasEthics
66 pages
Class Assignemt
No ratings yet
Class Assignemt
7 pages
Exploring Bias in Machine Learning Algorithms and Its Impact On Decision
No ratings yet
Exploring Bias in Machine Learning Algorithms and Its Impact On Decision
5 pages
Case Study On NLP
No ratings yet
Case Study On NLP
23 pages
Generative AI' and 'Analytical AI
No ratings yet
Generative AI' and 'Analytical AI
2 pages
New Assignment
No ratings yet
New Assignment
6 pages
Bias in AI Recruitment Systems
No ratings yet
Bias in AI Recruitment Systems
9 pages
Evaluating and Mitigating Social Bias For Large Language Models in Open-Ended Settings
No ratings yet
Evaluating and Mitigating Social Bias For Large Language Models in Open-Ended Settings
12 pages
AI Bias: Challenges and Solutions
No ratings yet
AI Bias: Challenges and Solutions
14 pages
ML Project
No ratings yet
ML Project
5 pages
Language and Linguist Compass - 2021 - Hovy - Five Sources of Bias in Natural Language Processing
No ratings yet
Language and Linguist Compass - 2021 - Hovy - Five Sources of Bias in Natural Language Processing
19 pages
偏见综述2411 10915v1
No ratings yet
偏见综述2411 10915v1
47 pages
Gender Bias in Neural Natural Language Processing: Preprint. Work in Progress
No ratings yet
Gender Bias in Neural Natural Language Processing: Preprint. Work in Progress
13 pages
AI and Fair Computing
No ratings yet
AI and Fair Computing
90 pages
Bias in Artificial Intelligence and Machine Learning
No ratings yet
Bias in Artificial Intelligence and Machine Learning
8 pages
Unit 6. Ethical Issues in Data Science PDF
No ratings yet
Unit 6. Ethical Issues in Data Science PDF
19 pages
Gender Bias and Stereotypes in Large Language Models: Hadas Kotek Rikker Dockum David Q. Sun
No ratings yet
Gender Bias and Stereotypes in Large Language Models: Hadas Kotek Rikker Dockum David Q. Sun
13 pages
Metrics For Dataset Demographic Bias A Case Study On Facial Expression Recognition
No ratings yet
Metrics For Dataset Demographic Bias A Case Study On Facial Expression Recognition
18 pages
Bonezzi Ostinelli 2021-Can Algorithms Legitimize Discrimination
No ratings yet
Bonezzi Ostinelli 2021-Can Algorithms Legitimize Discrimination
15 pages
EXplainable Artificial Intelligence (XAI) For Facilitating
No ratings yet
EXplainable Artificial Intelligence (XAI) For Facilitating
15 pages
2018 Gender Bias in Artificial Intelligence - The Need For Diversity and Gender Theory in Machine Learning
No ratings yet
2018 Gender Bias in Artificial Intelligence - The Need For Diversity and Gender Theory in Machine Learning
3 pages
Bias and Discrimination
No ratings yet
Bias and Discrimination
3 pages
Fairness Survey
No ratings yet
Fairness Survey
35 pages
Bad: Bias Detection For Large Language Models in The Context of Candidate Screening
No ratings yet
Bad: Bias Detection For Large Language Models in The Context of Candidate Screening
12 pages
ManagingBiasInAI CAMERAREADY
No ratings yet
ManagingBiasInAI CAMERAREADY
12 pages
Machine Learning To Be Like Thee For Al
No ratings yet
Machine Learning To Be Like Thee For Al
27 pages
Lecture
No ratings yet
Lecture
25 pages
Algorithms and Bias, Explained - Vox
No ratings yet
Algorithms and Bias, Explained - Vox
9 pages
Homo Silicus FUTURETECH
No ratings yet
Homo Silicus FUTURETECH
74 pages
End-To-End Bias Mitigation: Removing Gender Bias in Deep Learning
No ratings yet
End-To-End Bias Mitigation: Removing Gender Bias in Deep Learning
9 pages
A Survey On Bias in Machine Learning Research: August 2023
No ratings yet
A Survey On Bias in Machine Learning Research: August 2023
49 pages
The Hidden Dangers in Algorithmic Decision Making - Towards Data Science-1
No ratings yet
The Hidden Dangers in Algorithmic Decision Making - Towards Data Science-1
12 pages
111722202030-M Ramya ML Assignment-1
No ratings yet
111722202030-M Ramya ML Assignment-1
13 pages
Machine Leaning 1 Unit
No ratings yet
Machine Leaning 1 Unit
10 pages
Lecture5 Module6 Karumbaiah
No ratings yet
Lecture5 Module6 Karumbaiah
7 pages
Mitigating Bias in Algorithmic Hiring: Evaluating Claims and Practices
No ratings yet
Mitigating Bias in Algorithmic Hiring: Evaluating Claims and Practices
13 pages
Annotator Bias Llms
No ratings yet
Annotator Bias Llms
14 pages
AI Bias and Social Inequality
No ratings yet
AI Bias and Social Inequality
6 pages
Bias, Fairness in ML
No ratings yet
Bias, Fairness in ML
31 pages
AI Ethics and Bias Blog
No ratings yet
AI Ethics and Bias Blog
3 pages
Mind Body Problem
No ratings yet
Mind Body Problem
10 pages
Philosophy Compass - 2021 - Fazelpour - Algorithmic Bias Senses Sources Solutions
No ratings yet
Philosophy Compass - 2021 - Fazelpour - Algorithmic Bias Senses Sources Solutions
16 pages
Human-Centric Multimodal Machine Learning: Recent Advances and Testbed On AI-based Recruitment
No ratings yet
Human-Centric Multimodal Machine Learning: Recent Advances and Testbed On AI-based Recruitment
35 pages
NPTEL
No ratings yet
NPTEL
52 pages
A Set of Arabic Word Embedding Models For Use in Arabic NLP
No ratings yet
A Set of Arabic Word Embedding Models For Use in Arabic NLP
10 pages
Visual Question Answering: A State of The Art Review: Sruthy Manmadhan Binsu C. Kovoor
No ratings yet
Visual Question Answering: A State of The Art Review: Sruthy Manmadhan Binsu C. Kovoor
41 pages
Advanced Search Techniques Guide
No ratings yet
Advanced Search Techniques Guide
16 pages
Project Report RTT Rohan G A
No ratings yet
Project Report RTT Rohan G A
37 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
Deep Learning For Semantic Similarity
No ratings yet
Deep Learning For Semantic Similarity
7 pages
WORD EMBEDDING Project
No ratings yet
WORD EMBEDDING Project
15 pages
Unit Ii
No ratings yet
Unit Ii
20 pages
Applied Computational Intelligence and Soft Computing - 2024 - Geleta - Semisupervised Learning Based Word Sense
No ratings yet
Applied Computational Intelligence and Soft Computing - 2024 - Geleta - Semisupervised Learning Based Word Sense
11 pages
Master Thesis
No ratings yet
Master Thesis
58 pages
1 s2.0 S1877050922015058 Main
No ratings yet
1 s2.0 S1877050922015058 Main
11 pages
Research Paper
No ratings yet
Research Paper
6 pages
Complex Word Mathematics in Natural Language Processing (NLP) PDF
No ratings yet
Complex Word Mathematics in Natural Language Processing (NLP) PDF
10 pages
CCS369 Unit-2 20.12.24
No ratings yet
CCS369 Unit-2 20.12.24
41 pages
Deep Learning with Keras & NLP
No ratings yet
Deep Learning with Keras & NLP
21 pages
2025 Coling-Main 138
No ratings yet
2025 Coling-Main 138
12 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
Python Chatbot Development Guide
No ratings yet
Python Chatbot Development Guide
41 pages
Amharic Abstractive Text Summarization
No ratings yet
Amharic Abstractive Text Summarization
6 pages
Poverty Cause and Effect Essay
100% (2)
Poverty Cause and Effect Essay
8 pages
Unit 2a
No ratings yet
Unit 2a
51 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
37 pages
GCNs Enhance NER with Syntax
No ratings yet
GCNs Enhance NER with Syntax
9 pages
GloVe Word Vectors for CS Students
No ratings yet
GloVe Word Vectors for CS Students
24 pages
Natural Language Processing
No ratings yet
Natural Language Processing
57 pages
A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium
No ratings yet
A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium
13 pages
Quran Semantic Search Tool
No ratings yet
Quran Semantic Search Tool
12 pages
Learning Word Embeddings For Ukrainian: A Comparative Study of FastText Hyperparameters
No ratings yet
Learning Word Embeddings For Ukrainian: A Comparative Study of FastText Hyperparameters
12 pages
Natural Language Processing For Legal Document Review: Categorising Deontic Modalities in Contracts
No ratings yet
Natural Language Processing For Legal Document Review: Categorising Deontic Modalities in Contracts
22 pages
Word 2 Vec
No ratings yet
Word 2 Vec
8 pages