0% found this document useful (0 votes)

15 views4 pages

CORE 10 Unit 2

The document outlines the principles of test construction and standardization in psychology, focusing on item analysis, reliability, validity, and the development of norms. Item analysis evaluates question performance, reliability ensures consistent test results, and validity confirms that tests measure intended constructs. Norms provide average scores for comparison, helping to interpret individual test results against typical performance standards.

Uploaded by

kimhwayoung402

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views4 pages

CORE 10 Unit 2

Uploaded by

kimhwayoung402

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Principles of test construction and standardization- Item

analysis, reliability, validity and development of norms

1. What is Test Construction and Standardization?

In psychology, tests are tools used to measure things like intelligence, personality, skills, or
attitudes. To make sure these tests work well, psychologists follow certain principles when
creating (constructing) and standardizing them.

• Test construction: Designing and developing the test items/questions.

• Standardization: Making the test consistent so it can be used fairly with different
people.

2. Item Analysis

Item analysis is a process used to check how well each question (item) on the test is
performing.

Why do we do item analysis?

• To find out if each question is clear and useful.

• To remove or improve questions that don’t work well.

How does it work?

Two key statistics are used:

• Item Difficulty: How hard or easy is the question?

o It’s usually the percentage of people who answer it correctly.

o For example, if 90% of people get an item right, it’s an easy question; if only
20% get it right, it’s hard.

• Item Discrimination: How well does the question differentiate between high
scorers and low scorers?

o Good questions should be answered correctly more often by people who do

well on the whole test.

o Discrimination is calculated by comparing top performers and low

performers on the test.
In summary:

• Items that are too easy, too hard, or don’t discriminate well might be dropped or
revised.

3. Reliability

Reliability means how consistent and stable the test results are.

Why is reliability important?

• If you take the test twice (or give it to similar groups), you want similar results.

• Reliable tests give consistent, trustworthy scores.

Types of Reliability:

• Test-Retest Reliability: Give the test twice to the same group at different times and
check if scores are similar.

• Internal Consistency Reliability: Check if all items on the test measure the same
thing and give consistent results (e.g., Cronbach’s alpha).

• Inter-Rater Reliability: For tests scored by judges/observers, check if different

raters give similar scores.

How to improve reliability?

• Use clear questions.

• Have enough items (longer tests are often more reliable).

• Standardize instructions and conditions.

4. Validity

Validity means how well the test measures what it is supposed to measure.

Types of Validity:

• Content Validity: Does the test cover the entire topic or concept?
(Example: A math test should cover all important math skills intended to be
measured.)
• Construct Validity: Does the test really measure the psychological trait (construct)
like intelligence, anxiety, or creativity?

o Established through research and comparing test results with theory.

• Criterion-related Validity: Does the test predict real-world outcomes or behaviors?

o Predictive validity: Can test scores predict future performance?

o Concurrent validity: Does the test correlate well with other established
tests measuring the same thing?

Why is validity important?

• A test might be reliable but invalid (consistent but measuring the wrong thing).

• Valid tests provide meaningful and useful results.

5. Development of Norms

Norms are average scores and standards developed from a large, representative sample of
people.

Why do we need norms?

• To interpret individual test scores by comparing them with the average.

• Norms tell us what is "typical" or "average" performance.

How are norms developed?

• Administer the test to a large, diverse group of people (called the normative
sample).

• Calculate average scores, ranges, percentiles, and standard deviations.

• These norms help categorize scores (e.g., above average, below average).

Types of Norms:

• Age norms: Typical scores for different age groups.

• Grade norms: Typical scores for school grades.

• Percentile ranks: What percentage scored below a particular score.

Summary

Principle What it means Why it matters

Check quality of each test

Item Analysis Remove or fix bad questions
question

Reliability Consistency of test results Ensure dependable scores

Ensure meaningful and useful

Validity Test measures what it claims
results

Development of Creating average scores from a Compare individual scores to

Norms big group typical scores

Types of Psychological Tests- Individual, group, performance,

verbal, nonverbal
*You know it. Write it on your own

Bio (In Focus Year 12)
67% (3)
Bio (In Focus Year 12)
636 pages
Module 1 - INTRODUCTION TO SURVEYING
No ratings yet
Module 1 - INTRODUCTION TO SURVEYING
46 pages
PSYCHOLOGICAL ASSESSMENT Reviewer PDF
100% (1)
PSYCHOLOGICAL ASSESSMENT Reviewer PDF
33 pages
PSYCHOMETRY
No ratings yet
PSYCHOMETRY
34 pages
Psy. Testing
No ratings yet
Psy. Testing
43 pages
Psychological Measurement
No ratings yet
Psychological Measurement
7 pages
Msys 23 & Mcps 23 Psychometry
No ratings yet
Msys 23 & Mcps 23 Psychometry
127 pages
Psychological Test Construction and Standardization Notes
No ratings yet
Psychological Test Construction and Standardization Notes
3 pages
Unit - 1 Notes Somatic Disorder
No ratings yet
Unit - 1 Notes Somatic Disorder
17 pages
Test Construction
No ratings yet
Test Construction
13 pages
Measurement Concepts & Interpretation
No ratings yet
Measurement Concepts & Interpretation
21 pages
Lecture 3 - CH 4
No ratings yet
Lecture 3 - CH 4
41 pages
Vaishnavi Kaushik
No ratings yet
Vaishnavi Kaushik
65 pages
MPYAC13 Psychometry
No ratings yet
MPYAC13 Psychometry
233 pages
Dr. Most. Aeysha Sultana (MAS1) : PSY 101L: Psychological Experiment and Testing
No ratings yet
Dr. Most. Aeysha Sultana (MAS1) : PSY 101L: Psychological Experiment and Testing
13 pages
Test Construction
No ratings yet
Test Construction
27 pages
Reviewer Test and Measurement
No ratings yet
Reviewer Test and Measurement
13 pages
PTA Notes
No ratings yet
PTA Notes
24 pages
Psychological Testing Overview
No ratings yet
Psychological Testing Overview
14 pages
What To Look For in A Psychological Test
No ratings yet
What To Look For in A Psychological Test
32 pages
Psych Assess Reviewer
No ratings yet
Psych Assess Reviewer
8 pages
CC01 PA Introduction
No ratings yet
CC01 PA Introduction
11 pages
Introduction To Psychological Testing-1
No ratings yet
Introduction To Psychological Testing-1
5 pages
Psychology Testing Essentials
No ratings yet
Psychology Testing Essentials
56 pages
Lecture 4 - CH 4
No ratings yet
Lecture 4 - CH 4
42 pages
Psychometry
No ratings yet
Psychometry
9 pages
Test Construction
No ratings yet
Test Construction
35 pages
Testing and Assessment
No ratings yet
Testing and Assessment
14 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
8 pages
PSY 414 Psychological Testing & Construction 10.03.2021
No ratings yet
PSY 414 Psychological Testing & Construction 10.03.2021
73 pages
PsY. T Chapter 02 (Simplify)
No ratings yet
PsY. T Chapter 02 (Simplify)
29 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
13 pages
Assessment Rev Quiz 1
No ratings yet
Assessment Rev Quiz 1
5 pages
Introduction To Psychometric Testing FINAL
No ratings yet
Introduction To Psychometric Testing FINAL
7 pages
Psych Assessment - Midterms
No ratings yet
Psych Assessment - Midterms
16 pages
Psychological Testing Essentials
No ratings yet
Psychological Testing Essentials
112 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
5 pages
Measurement and Evaluation Answers
No ratings yet
Measurement and Evaluation Answers
29 pages
Psych Testing Reviewer Midterm
No ratings yet
Psych Testing Reviewer Midterm
9 pages
1st Chapter FY 2nd Sem CPSC
No ratings yet
1st Chapter FY 2nd Sem CPSC
15 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Statistics in Behavioural Sciences
No ratings yet
Statistics in Behavioural Sciences
68 pages
PsychAssessment Reviewer 4
No ratings yet
PsychAssessment Reviewer 4
3 pages
Wa0010.
No ratings yet
Wa0010.
5 pages
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
No ratings yet
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
50 pages
Lecture Handouts 04
No ratings yet
Lecture Handouts 04
5 pages
UNIT 2 Psych. Testing
No ratings yet
UNIT 2 Psych. Testing
13 pages
LMPSY118
No ratings yet
LMPSY118
27 pages
Research Methods (Part2)
No ratings yet
Research Methods (Part2)
27 pages
Psychological Testing Overview
No ratings yet
Psychological Testing Overview
31 pages
Characteristics, Construction and Evaluation of Psychological Tests
100% (1)
Characteristics, Construction and Evaluation of Psychological Tests
52 pages
Psychometrics KS
No ratings yet
Psychometrics KS
33 pages
Foundations of Psych Testing 2
No ratings yet
Foundations of Psych Testing 2
14 pages
Unit 3
No ratings yet
Unit 3
8 pages
Tickle: IQ and Personality Tests - Which Online Personality Test Are You? - Results The PROFILER Personality Test
No ratings yet
Tickle: IQ and Personality Tests - Which Online Personality Test Are You? - Results The PROFILER Personality Test
33 pages
Data Collection and Factor Analyzes:: Presented By: Presented To
No ratings yet
Data Collection and Factor Analyzes:: Presented By: Presented To
33 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
5 pages
Department of Applied Psychology Government College Women University Faisalabad
No ratings yet
Department of Applied Psychology Government College Women University Faisalabad
17 pages
Test Theory
No ratings yet
Test Theory
50 pages
Psychological Testing Principles of
67% (3)
Psychological Testing Principles of
32 pages
Generative Ai-In-The-Loop: Integrating Llms and Gpts Into The Next Generation Networks
No ratings yet
Generative Ai-In-The-Loop: Integrating Llms and Gpts Into The Next Generation Networks
9 pages
Custom DateTimePicker - Custom Controls WinForm C # - RJ Code Advance
No ratings yet
Custom DateTimePicker - Custom Controls WinForm C # - RJ Code Advance
12 pages
U3 w22 Revision 4b (Handout)
No ratings yet
U3 w22 Revision 4b (Handout)
12 pages
EMC Engineering Exam Insights
No ratings yet
EMC Engineering Exam Insights
3 pages
BS en 13335-2002 PDF
No ratings yet
BS en 13335-2002 PDF
12 pages
FYP Proposal (Tank Wall Crawler Robot)
0% (1)
FYP Proposal (Tank Wall Crawler Robot)
10 pages
Technical Catalogue PQC
No ratings yet
Technical Catalogue PQC
2 pages
EfkaPB2001 TDS
No ratings yet
EfkaPB2001 TDS
2 pages
Experiment 2
No ratings yet
Experiment 2
11 pages
ATI FT Sensor Catalog 2005
No ratings yet
ATI FT Sensor Catalog 2005
32 pages
The Practice of Ecological Art Sacha KAGAN, Institute of Sociology 2014
No ratings yet
The Practice of Ecological Art Sacha KAGAN, Institute of Sociology 2014
7 pages
RGS404 Rpa2030 Ep 1
No ratings yet
RGS404 Rpa2030 Ep 1
37 pages
MLGS Ii
No ratings yet
MLGS Ii
505 pages
Engineering Student Project Proposal
No ratings yet
Engineering Student Project Proposal
14 pages
PSC 2 - Conduction
No ratings yet
PSC 2 - Conduction
1 page
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
No ratings yet
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
5 pages
Associations Between Social Responsibility Disclosure and Characteristics of Companies
No ratings yet
Associations Between Social Responsibility Disclosure and Characteristics of Companies
8 pages
Enderlein and Pleomorphism
No ratings yet
Enderlein and Pleomorphism
1 page
FIITJEE Admission Test Broucher
No ratings yet
FIITJEE Admission Test Broucher
76 pages
Diffuse Double Layer
No ratings yet
Diffuse Double Layer
14 pages
Photography - Tips & Tricks
No ratings yet
Photography - Tips & Tricks
13 pages
Kawai Indonesia Factory Report
No ratings yet
Kawai Indonesia Factory Report
5 pages
Result Sem-6 UG 2025 Sixth 9 Subjects-1
No ratings yet
Result Sem-6 UG 2025 Sixth 9 Subjects-1
22 pages
Physics Project
No ratings yet
Physics Project
15 pages
PicoWay Candela Specifications Brochure Resolve
100% (1)
PicoWay Candela Specifications Brochure Resolve
8 pages
The Construction of Family in Selected Disney Animated Films
No ratings yet
The Construction of Family in Selected Disney Animated Films
4 pages
قواعد التوثيق في البحوث والدراسات التربوية
No ratings yet
قواعد التوثيق في البحوث والدراسات التربوية
19 pages
Generator Spare Parts Budget-2020
No ratings yet
Generator Spare Parts Budget-2020
106 pages

CORE 10 Unit 2

Uploaded by

CORE 10 Unit 2

Uploaded by

Principles of test construction and standardization- Item

analysis, reliability, validity and development of norms

• Test construction: Designing and developing the test items/questions.

Why do we do item analysis?

• To find out if each question is clear and useful.

• To remove or improve questions that don’t work well.

How does it work?

Two key statistics are used:

• Item Difficulty: How hard or easy is the question?

o It’s usually the percentage of people who answer it correctly.

o Good questions should be answered correctly more often by people who do

o Discrimination is calculated by comparing top performers and low

Why is reliability important?

• Reliable tests give consistent, trustworthy scores.

• Inter-Rater Reliability: For tests scored by judges/observers, check if different

How to improve reliability?

• Use clear questions.

• Have enough items (longer tests are often more reliable).

• Standardize instructions and conditions.

o Established through research and comparing test results with theory.

• Criterion-related Validity: Does the test predict real-world outcomes or behaviors?

o Predictive validity: Can test scores predict future performance?

Why is validity important?

• Valid tests provide meaningful and useful results.

Why do we need norms?

• To interpret individual test scores by comparing them with the average.

• Norms tell us what is "typical" or "average" performance.

How are norms developed?

• Calculate average scores, ranges, percentiles, and standard deviations.

• Age norms: Typical scores for different age groups.

• Grade norms: Typical scores for school grades.

• Percentile ranks: What percentage scored below a particular score.

Principle What it means Why it matters

Check quality of each test

Reliability Consistency of test results Ensure dependable scores

Ensure meaningful and useful

Development of Creating average scores from a Compare individual scores to

Types of Psychological Tests- Individual, group, performance,

You might also like