Human Computation

lecture

Uploaded by

Jimmy Teng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views33 pages

Human Computation

lecture

Uploaded by

Jimmy Teng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

[Introduction to mass collaboration], [Human computation],

[Open call], [Distributed data collection],

[Fragile Families Challenge]

Matthew J. Salganik
Department of Sociology
Princeton University
1) Introduction
2) Observing behavior
3) Asking questions
4) Running experiments
5) Mass collaboration
6) Ethics
7) The future
Fig 5.4 (Salganik 2018)
Fig 5.4 (Salganik 2018)
Human computation:
I Easy task, big scale problems where humans better than computers
Human computation:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
Human computation:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
I Human effort can be magnified with supervised learning
Human computation:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
I Human effort can be magnified with supervised learning
I Increasingly important as we move from numeric survey data to working with text,
images, movies, and audio.
Galaxy Zoo
Astronomers are interested in understanding the relationship between the shape and
color of galaxies

(a) Elliptical (b) Spiral

Galaxy Zoo
Needed hand-classified galaxies so Schawninski worked seven, 12 hour days to classify
50,000 galaxies
Galaxy Zoo
Needed hand-classified galaxies so Schawninski worked seven, 12 hour days to classify
50,000 galaxies

Only 5% of the ∼ 1 million galaxies in the Sloan Digital Sky Survey. A new approach
was needed . . . .
The aim
y neural
man clas-
galaxies,
morphol-
it is not
fications.
adopted.
of visual
al. 2007;
extraor-
ets of the
ed basic
0.05 and
e results
ption of
08), pro-
SS main
Figure 1. Main analysis page from the Galaxy Zoo web site.
Galaxy Zoo

I Volunteers had a ∼5 minute training and passed a quiz

I Categorized as many or as few galaxies as they wished
I Much of the recruiting happened through the media
Galaxy Zoo

(a) Classifications over time (b) Classifications per user

Galaxy Zoo

40 million classification to a consensus labels (Lintott et al., 2011)

1. Cleaning
I only the first classification that a volunteer made of a specific galaxy was used in the
analysis
I anyone who classified more than 2 galaxies more than 5 times each had all their
classifications discarded
2. De-biasing
I bias to classify far away spiral galaxies as elliptical galaxies (Bamford et al., 2009)
Galaxy Zoo

40 million classification to a consensus labels (Lintott et al., 2011)

Produces data comparable in quality to expert coders (Lintott et al. 2011), but at
much greater scale
Galaxy Zoo

From millions to billions to trillions . . . .

Galaxy Zoo

Fig 5.4 (Salganik 2018), inspired by Banerji et al. (2010)

https://www.zooniverse.org/
Benoit et al. (2016)
Benoit et al. (2016)
Here’s a piece of the manifesto of the Labor Party in the United Kingdom from 2010:
“Millions of people working in our public services embody the best values of
Britain, helping empower people to make the most of their own lives while
protecting them from the risks they should not have to bear on their own.
Just as we need to be bolder about the role of government in making markets
work fairly, we also need to be bold reformers of government.”
What I like about Benoit et al. (2016)
I Better not cheaper
What I like about Benoit et al. (2016)
I Better not cheaper
I Experts are a bug not a feature
Wrapping up:
I Easy task, big scale problems where humans better than computers
Wrapping up:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
Wrapping up:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
I Human effort can be magnified with supervised learning
Wrapping up:
I Easy task, big scale problems where humans better than computers
I Split-apply-combine strategy
I Human effort can be magnified with supervised learning
I Increasingly important as we move from numeric survey data to working with text,
images, movies, and audio.
What to read next:
I Human computation (Law and von Ahn, 2011)
I reCAPTCHA (von Ahn et al. 2008)
I Background about Amazon Mechanical Turk: Bohannon 2016
[Introduction to mass collaboration], [Human computation],
[Open call], [Distributed data collection],
[Fragile Families Challenge]

Matthew J. Salganik
Department of Sociology
Princeton University

Glad We Met: The Art and Science of 1:1 Meetings Steven G. Rogelberg PDF Download
100% (1)
Glad We Met: The Art and Science of 1:1 Meetings Steven G. Rogelberg PDF Download
147 pages
TAT Practical Guide
100% (2)
TAT Practical Guide
133 pages
(Ebook) Extremals For The Sobolev Inequality and The Quaternionic Contact Yamabe Problem by Stefan P. Ivanov, Dimiter N. Vassilev ISBN 9814295701 Download
100% (3)
(Ebook) Extremals For The Sobolev Inequality and The Quaternionic Contact Yamabe Problem by Stefan P. Ivanov, Dimiter N. Vassilev ISBN 9814295701 Download
145 pages
Networking and Ubiquitous Computing Mobile Communications 1833422
100% (1)
Networking and Ubiquitous Computing Mobile Communications 1833422
140 pages
Computing in Nonlinear Media and Automata Collectives 1st Edition Andrew Adamatzky Available Instanly
100% (2)
Computing in Nonlinear Media and Automata Collectives 1st Edition Andrew Adamatzky Available Instanly
76 pages
Swarm Drone
No ratings yet
Swarm Drone
666 pages
Horizon Academic Research Journal Vol. 4 No. 1
No ratings yet
Horizon Academic Research Journal Vol. 4 No. 1
406 pages
Genetic and Evolutionary Computing Proceedings of the Seventh International Conference on Genetic and Evolutionary Computing ICGEC 2013 August 25 27 2013 Prague Czech Republic 1st Edition Ladislav Zjavka (Auth.) newest edition 2025
No ratings yet
Genetic and Evolutionary Computing Proceedings of the Seventh International Conference on Genetic and Evolutionary Computing ICGEC 2013 August 25 27 2013 Prague Czech Republic 1st Edition Ladislav Zjavka (Auth.) newest edition 2025
100 pages
Design of Intelligent Systems Based On Fuzzy Logic, Neural Networks and Nature-Inspired Optimization
No ratings yet
Design of Intelligent Systems Based On Fuzzy Logic, Neural Networks and Nature-Inspired Optimization
612 pages
Data Science-Ivo D. Dinov
No ratings yet
Data Science-Ivo D. Dinov
490 pages
Weintrop Et Al. - 2015 - Defining Computational Thinking For Mathematics An PDF
No ratings yet
Weintrop Et Al. - 2015 - Defining Computational Thinking For Mathematics An PDF
21 pages
Complexity: A Guided Tour
No ratings yet
Complexity: A Guided Tour
12 pages
Business Process Crowdsourcing Nguyen Hoang Thuan Full
No ratings yet
Business Process Crowdsourcing Nguyen Hoang Thuan Full
151 pages
Solar Features
No ratings yet
Solar Features
75 pages
Innovation in The Fields of Transboundary Waters and Natural Resources Management The Legacy of DR David J H Phillips 51605254
No ratings yet
Innovation in The Fields of Transboundary Waters and Natural Resources Management The Legacy of DR David J H Phillips 51605254
68 pages
Frank Schweitzer - Agents, Networks, Evolution - A Quarter Century of Advances in Complex Systems (2022, World Scientific Publishing) - Libgen - Li
No ratings yet
Frank Schweitzer - Agents, Networks, Evolution - A Quarter Century of Advances in Complex Systems (2022, World Scientific Publishing) - Libgen - Li
612 pages
Pre-Hiring, Hiring, and Post-Hiring
0% (1)
Pre-Hiring, Hiring, and Post-Hiring
11 pages
A Semantic Web Primer Cooperative Information Systems Grigoris Antoniou Download
No ratings yet
A Semantic Web Primer Cooperative Information Systems Grigoris Antoniou Download
73 pages
Historical Review of Midwifery
0% (1)
Historical Review of Midwifery
3 pages
2012 Book SwarmEvolutionaryAndMemeticCom
No ratings yet
2012 Book SwarmEvolutionaryAndMemeticCom
830 pages
Pampanga 3
No ratings yet
Pampanga 3
5 pages
EE485 Proposal
No ratings yet
EE485 Proposal
2 pages
New Challenges in Computational Collective Intelligence 2009
No ratings yet
New Challenges in Computational Collective Intelligence 2009
347 pages
UPSC Science & Tech Class Notes
No ratings yet
UPSC Science & Tech Class Notes
303 pages
Think l5 Unit 3 Vocabulary Extension
100% (1)
Think l5 Unit 3 Vocabulary Extension
2 pages
08 Wisdom of The Crowd
No ratings yet
08 Wisdom of The Crowd
58 pages
PW Science
No ratings yet
PW Science
100 pages
Diagonalization and Forcing FLEX From Ca
No ratings yet
Diagonalization and Forcing FLEX From Ca
74 pages
AlineaY3 - Elicitation Process and Knowledge Structuring A Conceptual Framework For Biodiversity
No ratings yet
AlineaY3 - Elicitation Process and Knowledge Structuring A Conceptual Framework For Biodiversity
3 pages
Undertaking Format - CA
No ratings yet
Undertaking Format - CA
1 page
Applications of Evolutionary Computation, Part II
No ratings yet
Applications of Evolutionary Computation, Part II
547 pages
Chapter 9 Report Aigerim
No ratings yet
Chapter 9 Report Aigerim
10 pages
ML Week 16
No ratings yet
ML Week 16
5 pages
ICLR 2019 Notes
No ratings yet
ICLR 2019 Notes
56 pages
Introduction To High Performance Scientific Computing
No ratings yet
Introduction To High Performance Scientific Computing
510 pages
Reinventing Discovery
No ratings yet
Reinventing Discovery
4 pages
Intelligence For Astronomy: Artificial
No ratings yet
Intelligence For Astronomy: Artificial
4 pages
Intro To Course - Administrivia
No ratings yet
Intro To Course - Administrivia
30 pages
Public Participation in Scientific Research - A Framework For Deli
No ratings yet
Public Participation in Scientific Research - A Framework For Deli
40 pages
Challenges in VTU Ph.D. Coursework
100% (2)
Challenges in VTU Ph.D. Coursework
8 pages
Generative AI Uses and Risks For Knowledge Workers in A Science Organization
No ratings yet
Generative AI Uses and Risks For Knowledge Workers in A Science Organization
19 pages
APDS03 Big Data Day 3
No ratings yet
APDS03 Big Data Day 3
112 pages
Short Communication of Big Plithogenic Science and Deep Plithogenic Science
No ratings yet
Short Communication of Big Plithogenic Science and Deep Plithogenic Science
11 pages
STS Reviewer
No ratings yet
STS Reviewer
6 pages
1.1 Computation: Now and Then
No ratings yet
1.1 Computation: Now and Then
5 pages
Galactica: A Large Language Model For Science
No ratings yet
Galactica: A Large Language Model For Science
58 pages
SIGBOVIK 2022 A 23 MW Data Centre Is All You Need
No ratings yet
SIGBOVIK 2022 A 23 MW Data Centre Is All You Need
10 pages
MCKK IB Math Course Overview
No ratings yet
MCKK IB Math Course Overview
9 pages
Fuzzy Cluste2ring A Historical Perspective
No ratings yet
Fuzzy Cluste2ring A Historical Perspective
11 pages
First Exam 4th
No ratings yet
First Exam 4th
2 pages
2026SM6 Automating Galaxy Classification With Unsupervised Machine Learning
No ratings yet
2026SM6 Automating Galaxy Classification With Unsupervised Machine Learning
10 pages
Dilnaz Ruslanova (Report) Comments
No ratings yet
Dilnaz Ruslanova (Report) Comments
4 pages
How To Improve Student English-Speaking Skill
No ratings yet
How To Improve Student English-Speaking Skill
2 pages
Nanomaterials Course Overview
No ratings yet
Nanomaterials Course Overview
5 pages
Ishida Ou Funkel
No ratings yet
Ishida Ou Funkel
45 pages
Big Data Online Learning Guide
No ratings yet
Big Data Online Learning Guide
116 pages
05a Sep20
No ratings yet
05a Sep20
22 pages
IEEE BIOCOMPUTING GordanaDC
No ratings yet
IEEE BIOCOMPUTING GordanaDC
30 pages
Reading Report 1 On Paquin Chapter 4, SN Chapter 3 Temirlan
No ratings yet
Reading Report 1 On Paquin Chapter 4, SN Chapter 3 Temirlan
7 pages
Education: Education in The Age of AI
100% (1)
Education: Education in The Age of AI
13 pages
Calander 2018-2019 Tusd
No ratings yet
Calander 2018-2019 Tusd
1 page
Using Large Language Models To Help Train Machine Learning SDG Classifiers. DESA WORKING PAPER NO. 180
No ratings yet
Using Large Language Models To Help Train Machine Learning SDG Classifiers. DESA WORKING PAPER NO. 180
18 pages
International Political Economy Ayazhan
No ratings yet
International Political Economy Ayazhan
6 pages
Beyond Classification and Prediction Meskhidze
No ratings yet
Beyond Classification and Prediction Meskhidze
20 pages
Unsupervised Learning in Astronomy
No ratings yet
Unsupervised Learning in Astronomy
23 pages
Unsupervised by Any Other Name - Hidden Layers of Knowledge Production in Artificial Intelligence On Social Media
No ratings yet
Unsupervised by Any Other Name - Hidden Layers of Knowledge Production in Artificial Intelligence On Social Media
11 pages
STEM and Robotics
No ratings yet
STEM and Robotics
17 pages
ECC Application Form
No ratings yet
ECC Application Form
2 pages
Assignment 1
No ratings yet
Assignment 1
1 page
Lecture Set 1
No ratings yet
Lecture Set 1
52 pages
Scientific Computing Essentials
No ratings yet
Scientific Computing Essentials
15 pages
Soft Computing for Tech Enthusiasts
No ratings yet
Soft Computing for Tech Enthusiasts
17 pages
Book Review Allison-Darya V
No ratings yet
Book Review Allison-Darya V
3 pages
Alana Freakonomics
No ratings yet
Alana Freakonomics
3 pages
Colla 2012 Panel PDF
No ratings yet
Colla 2012 Panel PDF
31 pages
Computational Intelligence Overview
No ratings yet
Computational Intelligence Overview
36 pages
Juniji-Hogo by Zen Master Daichi Sokei Zenji
No ratings yet
Juniji-Hogo by Zen Master Daichi Sokei Zenji
3 pages
Aripbaeva Elmira CH 2-3 FL Comments
No ratings yet
Aripbaeva Elmira CH 2-3 FL Comments
2 pages
Minsky - Steps Toward Artificial Intelligence
No ratings yet
Minsky - Steps Toward Artificial Intelligence
23 pages
Chatgptforresearchguide
No ratings yet
Chatgptforresearchguide
13 pages
Gulim Ir Lynn
No ratings yet
Gulim Ir Lynn
4 pages
Acemoglu Zhaniya
No ratings yet
Acemoglu Zhaniya
2 pages
TheInternationalSystemLevelofAnalysis Elmira
No ratings yet
TheInternationalSystemLevelofAnalysis Elmira
4 pages
US - Hegemony - Report Nizor Comments
No ratings yet
US - Hegemony - Report Nizor Comments
4 pages
Reading Report #3 Inkar
No ratings yet
Reading Report #3 Inkar
10 pages
PARUKH GULIM Paquin CH 4, SN CH 3
No ratings yet
PARUKH GULIM Paquin CH 4, SN CH 3
10 pages
TheDyadicLevelofAnalysis, PartI - TheNatureofDyads-ReallyBadDyads - 2 Elmira
No ratings yet
TheDyadicLevelofAnalysis, PartI - TheNatureofDyads-ReallyBadDyads - 2 Elmira
5 pages
Reading Report #2 Inkar
No ratings yet
Reading Report #2 Inkar
9 pages
Student Nazerke Abuova FL 11, 12
No ratings yet
Student Nazerke Abuova FL 11, 12
6 pages
RDR Dilnaz Comments
No ratings yet
RDR Dilnaz Comments
7 pages
Chapter 12, 13 Parukh Gulim
No ratings yet
Chapter 12, 13 Parukh Gulim
4 pages
Chapter Five Selected Topics Ayazhan
No ratings yet
Chapter Five Selected Topics Ayazhan
4 pages
Measures To Control Population Growth in India
No ratings yet
Measures To Control Population Growth in India
4 pages
Managerial Economics Course Overview
No ratings yet
Managerial Economics Course Overview
57 pages
Gulazor - Report 1 Comments
No ratings yet
Gulazor - Report 1 Comments
5 pages
Chapter Six - Madina
No ratings yet
Chapter Six - Madina
8 pages
Bring IT On 20: ST ND
No ratings yet
Bring IT On 20: ST ND
3 pages
Alana Clark
No ratings yet
Alana Clark
3 pages
Book Review Buchanan Darya V
No ratings yet
Book Review Buchanan Darya V
3 pages
Bad Samaritans - Aiym, Dilrabo, Mika
No ratings yet
Bad Samaritans - Aiym, Dilrabo, Mika
15 pages
(2019) (Betz) Is The Force Awakening
No ratings yet
(2019) (Betz) Is The Force Awakening
8 pages
Educ 6 142 Module 1 Lesson 1 and 2
No ratings yet
Educ 6 142 Module 1 Lesson 1 and 2
29 pages
The Rise of Western World - Mika, Aiym, Dilrabo
No ratings yet
The Rise of Western World - Mika, Aiym, Dilrabo
12 pages
Introduction to Computational Social Science
No ratings yet
Introduction to Computational Social Science
43 pages
R-01-POL-PC Policy On Registration in Professional Categories
No ratings yet
R-01-POL-PC Policy On Registration in Professional Categories
31 pages
Shadowing Technique Boosts Pronunciation
No ratings yet
Shadowing Technique Boosts Pronunciation
20 pages
Pots Resume
No ratings yet
Pots Resume
3 pages
Multimodal AI On Wound Images and Clinical Notes For Home Patient Referral
No ratings yet
Multimodal AI On Wound Images and Clinical Notes For Home Patient Referral
11 pages
MSc Financial Economics Guide
No ratings yet
MSc Financial Economics Guide
4 pages
Physical Cyber Social Computing For Human Experience: Amit Sheth Pramod Anantharam
No ratings yet
Physical Cyber Social Computing For Human Experience: Amit Sheth Pramod Anantharam
7 pages
Moving Beyond Simple Experiments
No ratings yet
Moving Beyond Simple Experiments
24 pages
Course Teaching Plan
No ratings yet
Course Teaching Plan
5 pages
PGDM Brochure & Flyers at Gibs Bangalore - Top PGDM College in Bangalore - Business Management Programme
No ratings yet
PGDM Brochure & Flyers at Gibs Bangalore - Top PGDM College in Bangalore - Business Management Programme
19 pages
Adijfpqo
No ratings yet
Adijfpqo
8 pages
ITIL Practitioner 160317
No ratings yet
ITIL Practitioner 160317
26 pages
4530 - CIP Interim Report - Ruchi
No ratings yet
4530 - CIP Interim Report - Ruchi
15 pages
Revival and Reinvention of Kathak Dance
No ratings yet
Revival and Reinvention of Kathak Dance
14 pages
Buddhi Dharma University An Analysis of Moral Value On The Girl in Pieces Novel Written by Kathleen Glasgow
No ratings yet
Buddhi Dharma University An Analysis of Moral Value On The Girl in Pieces Novel Written by Kathleen Glasgow
41 pages

Human Computation

Uploaded by

Human Computation

Uploaded by

[Introduction to mass collaboration], [Human computation],

[Open call], [Distributed data collection],

(a) Elliptical (b) Spiral

I Volunteers had a ∼5 minute training and passed a quiz

(a) Classifications over time (b) Classifications per user

40 million classification to a consensus labels (Lintott et al., 2011)

40 million classification to a consensus labels (Lintott et al., 2011)

40 million classification to a consensus labels (Lintott et al., 2011)

From millions to billions to trillions . . . .

Fig 5.4 (Salganik 2018), inspired by Banerji et al. (2010)

You might also like