0% found this document useful (0 votes)

116 views12 pages

Week 7 - Test Development

The test development process involves several steps: 1) Test conceptualization which involves answering preliminary questions about what the test will measure, who will use it, and how it will be administered. 2) Test construction including writing test items, developing scales, and constructing the test. 3) Test tryout which involves piloting the test and analyzing item performance and reliability. 4) Test revision based on results of the tryout to improve the test before full implementation.

Uploaded by

mystically12345678

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views12 pages

Week 7 - Test Development

Uploaded by

mystically12345678

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Test Development Process

Test Development ➢

Test Conceptualization SOME PRELIMINARY QUESTIONS

Test Construction
• What is the test designed to measure?
• What is the objective of the test?
• Is there a need for this test?
• Who will use this test?
Test Tryout • Who will take this test?
• What content will the test cover?
• How will the test be administered?
• What is the ideal format of the test?
Item Analysis • Should more than one form of the test be developed?
• What special training will be required of test users for
administering or interpreting the test?
→
• What types of responses will be required of test
takers?
• Who benefits from an administration of this test?
→ • Is there any potential for harm as the result of an
administration of this test?
• How will meaning be attributed to scores on this test?
Test Revision Norm-Referenced vs Criterion-Referenced Tests: Item
Development Issues:
➢
→

→
Norm-Referenced Test

Test Conceptualization
→
➢

→
“There ought to be a test designed to measure [fill in
the blank] in [such and such] way.”
➢ Criterion Referenced Test

➢
➢
TYPES OF SCALES:
Age-Based Scale

Grade-Based Scale

Stanine Scale
→
Unidimensional Scale
PILOT WORK
Multidimensional Scale
Pilot Work, Pilot Study, and Pilot Research

SCALING METHODS:

➢ Rating Scale

➢
➢
➢
➢

Test Construction ➢

SCALING
→ Summative Rating Scale
Scaling

➢
→ Likert Scale

scale values

L.L. Thurstone

→
➢

➢ absolute scaling
→
→ ➢

Method of Paired Comparison ➢ Scalogram Analysis

➢
➢ Objective

Method of Equal-Appearing Intervals

➢ ➢

WRITING ITEMS
➢ Advantage
➢

Sorting Task
➢

➢ Comparative Scale → What range of content should the items cover?

→ Which of the many different types of item formats
should be employed?
➢
→ How many items should be written in total and for
each content area
➢ Categorical Scale
Item Pool

Guttman Scale ➢

➢
→ True-False Item

➢
→ → Other Variety of Binary-Choice Format

→ →

→ → Disadvantage

Constructed-Response Format

Item Format
Types of Constructed-Response Item Format:
Completion Item

Selected-Response Format →

→ Disadvantage
→

Short-Answer Item

Types of Selected-Response Item Format:

→
Multiple-Choice Format

Essay Item

Matching Item

→
→ Drawback

Writing Items for Computer Administration

Binary-Choice Format
Item Bank ➢

➢ Advantages
→ E.g., If a respondent answers an item in a way that
suggests he or she is depressed, the computer
might automatically probe for depression-related
➢ symptoms and behavior.
→
Item Branching

➢ SCORING ITEMS

Cumulative Model

➢ Computerized Adaptive Testing (CAT)

Class Scoring or Category Scoring

→

→ →

→ Advantages:
Ipsative Scoring

Floor Effect →

→
Test Tryout
Ceiling Effect
➢

→ ➢
➢ ➢

Item-Endorsement Index
→

phantom
factors ➢ p
p1
➢
➢

→ If 50 of the 100 examinees answered item 2

WHAT IS A GOOD ITEM? 𝟓𝟎
correctly, then: 𝒑𝟐 = 𝟏𝟎𝟎 =. 𝟓; and
➢ → If 75 of the 100 examinees answered item 2
➢ 𝟕𝟓
correctly, then: 𝒑𝟑 = 𝟏𝟎𝟎 =. 𝟕𝟓; we could say that
→ item 3 was easier than item 2.
➢

Item Analysis ∑𝒑
𝒂𝒗𝒆𝒓𝒂𝒈𝒆 𝒑 = 𝒏
Item Analysis ➢

➢ quantitatively
qualitative ➢

➢
𝐜𝐡𝐚𝐧𝐜𝐞 𝐨𝐟 𝐬𝐮𝐜𝐞𝐬𝐬 𝐩𝐫𝐨𝐩𝐨𝐫𝐭𝐢𝐨𝐧 + 𝟏. 𝟎𝟎
𝑰𝒅𝒆𝒂𝒍 𝒑 =
𝟐
item’s difficulty
item’s reliability ideal p = 0.75
item’s validity
ideal p = 0.6
item discrimination
ITEM-RELIABILITY INDEX
ITEM-DIFFICULTY INDEX
Item-Reliability Index
Item’s Difficulty

→
➢
(s)
(r)
➢

[1] The item-score standard deviation

→
s1

𝒔𝟏 = √𝒑𝟏 (𝟏 − 𝒑𝟏 )
Factor Analysis – The correlation (r) between the item score and the
criterion score
➢
→
(r1 C)
(s1),

Item-validity index = 𝒔𝟏 𝒓𝟏 𝒄

→
ITEM-DISCRIMINATION INDEX

Item-Discrimination Index
→

→ → E.g., A multiple-choice item on an achievement

test is a good item if most of the high scorers
answer correctly and most of the low scorers
answer incorrectly.
→ An item on an achievement test is not doing its job
ITEM-VALIDITY INDEX if it is answered correctly by respondents who
Item-Validity Index least understand the subject matter.
→ An item on a test purporting to measure a
particular personality trait is not doing its job if
responses indicate that people who score very low
on the test as a whole (indicating absence or low
levels of the trait in question) tend to score very
➢
high on the item (indicating that they are very
high on the trait in question—contrary to what the
test as a whole indicates).
➢

➢
d d

(U)
(L)
→

➢ Alternatives
𝑼−𝑳 ∙A B C D E
𝒅=( ) 𝒏= Item 1 U 24 3 2 0 3
𝒏
L 10 5 6 6 5

U L

(U) (L)
Alternatives
A B C D ∙E
Item 2 U 2 13 3 2 12
L 6 7 5 7 7

U
Alternatives
U A B ∙C D E
L Item 3 U 0 0 32 0 0
L 3 2 22 2 3

U
U
L
L
Alternatives
A ∙B C D E
d Item 4 U 5 15 0 5 7
L 4 5 4 4 14

d = +1.00 → U
L

d=0→ U
L

d = –1.00 → U
L Alternatives
A B C ∙D E
Item 5 U 14 0 0 5 13
L 7 0 0 16 9
ANALYSIS OF ITEM ALTERNATIVES. L
U

➢
ITEM-CHARACTERISTIC CURVES OTHER CONSIDERATIONS IN ITEM ANALYSIS

Item-Characteristic Curves GUESSING.

➢ For Discriminability Level:

→

➢ For Difficulty Level:

→

ITEM FAIRNESS.

→
SPEED TESTS. Expert Panels

→ Sensitivity Review

→
→

QUALITATIVE ITEM ANALYSIS

→
Qualitative Methods

→
Test Revisions

TEST REVISION AS A STAGE IN NEW TEST DEVELOPMENT

→

→
→

Qualitative Item Analysis

→

➢ One cautionary note: →

→
“Think Aloud” Test Administration

→
➢

→
TEST REVISION IN THE LIFE CYCLE OF AN EXISTING TEST
➢ ➢

→
➢

An existing test be kept in its present form as long

as it remains “useful” but that it should be revised ➢
“when significant changes in the domain
represented, or new conditions of test use and
interpretation, make the test inappropriate for its →
intended use.”
➢ →

Cross-Validation

validity shrinkage
→

Co-Validation

co-norming
THE USE OF IRT IN BUILDING AND REVISING TESTS
➢
→

[1] Evaluating the properties of existing tests and guiding

test revision. →
→

→
→

→
[2] Determining measurement equivalence across test
taker populations.
→ Differential Item Functioning (DIF)

→
→ DIF Analysis

→ DIF Items

[3] Developing item banks.

→

Stages of Test Development
100% (6)
Stages of Test Development
3 pages
Lesson 2 - Common Terminologies
100% (2)
Lesson 2 - Common Terminologies
48 pages
Piat-R 1 PDF
50% (2)
Piat-R 1 PDF
10 pages
Test Development Process Guide
50% (2)
Test Development Process Guide
25 pages
Test, Measurement, Evaluation, and Assessment
No ratings yet
Test, Measurement, Evaluation, and Assessment
8 pages
Test Development of Assessment
No ratings yet
Test Development of Assessment
26 pages
Item Response Theory For Dummies'
100% (1)
Item Response Theory For Dummies'
16 pages
MODULE 8: Test Development: PSY 112: Psychological Assessment
No ratings yet
MODULE 8: Test Development: PSY 112: Psychological Assessment
59 pages
Chapter 8 Test Development
100% (1)
Chapter 8 Test Development
3 pages
Roma Flores Psychological Test Development Procedures
No ratings yet
Roma Flores Psychological Test Development Procedures
13 pages
Test Conceptualization: Norm-Referenced Vs Criterion-Referenced
No ratings yet
Test Conceptualization: Norm-Referenced Vs Criterion-Referenced
7 pages
Chapter 8 Test Development
No ratings yet
Chapter 8 Test Development
4 pages
Test Development
No ratings yet
Test Development
17 pages
Scale Development Guide
No ratings yet
Scale Development Guide
32 pages
Tests and Tools Construction in Educational Research: Course Instructor Dr. Maksal Minaz
No ratings yet
Tests and Tools Construction in Educational Research: Course Instructor Dr. Maksal Minaz
21 pages
Test Construction
No ratings yet
Test Construction
9 pages
Types of Norm
No ratings yet
Types of Norm
9 pages
C H A P T E R 8 Test Development
No ratings yet
C H A P T E R 8 Test Development
9 pages
Item Analysis
100% (1)
Item Analysis
14 pages
Characteristics, Construction and Evaluation of Psychological Tests
100% (1)
Characteristics, Construction and Evaluation of Psychological Tests
52 pages
Test-Development-and-Administration (Edited)
No ratings yet
Test-Development-and-Administration (Edited)
5 pages
Research Measurement & Design
100% (4)
Research Measurement & Design
83 pages
NOTES: The Process of Test Development
No ratings yet
NOTES: The Process of Test Development
5 pages
Finals Psychass Reviewer
No ratings yet
Finals Psychass Reviewer
11 pages
Test Development
No ratings yet
Test Development
30 pages
20201231171835D4978 - Psikometri 1 - 2
No ratings yet
20201231171835D4978 - Psikometri 1 - 2
28 pages
REVIEWER
No ratings yet
REVIEWER
8 pages
Steps in Test Construction FINAL
No ratings yet
Steps in Test Construction FINAL
20 pages
MA SEM II Psychological Testing and Psychometrics Practicals
No ratings yet
MA SEM II Psychological Testing and Psychometrics Practicals
63 pages
Test Construction and Development
No ratings yet
Test Construction and Development
3 pages
Unit 1 Test Development
No ratings yet
Unit 1 Test Development
57 pages
Test Construction
No ratings yet
Test Construction
35 pages
PsychAssess 5 TestDevelopment
No ratings yet
PsychAssess 5 TestDevelopment
4 pages
Item Response Theory
100% (1)
Item Response Theory
14 pages
Final PPT g6
No ratings yet
Final PPT g6
36 pages
Week13 - Ã Ä Renci
No ratings yet
Week13 - Ã Ä Renci
41 pages
Test Construction and Development
No ratings yet
Test Construction and Development
3 pages
Understanding Personality Assessments
No ratings yet
Understanding Personality Assessments
10 pages
Instrumentation and Data Collection
No ratings yet
Instrumentation and Data Collection
60 pages
LECTURE 3 - Test Development - 044659
No ratings yet
LECTURE 3 - Test Development - 044659
15 pages
Week 9 - Assessment For Education
No ratings yet
Week 9 - Assessment For Education
8 pages
Personalized E-Learning with IRT
No ratings yet
Personalized E-Learning with IRT
19 pages
V. Test Development 2
No ratings yet
V. Test Development 2
29 pages
Reporting - Test Development
No ratings yet
Reporting - Test Development
5 pages
(Robert J. Marzano) Classroom Assessment Grading (B-Ok - CC)
No ratings yet
(Robert J. Marzano) Classroom Assessment Grading (B-Ok - CC)
10 pages
Prof Ed 6 Module 5
No ratings yet
Prof Ed 6 Module 5
9 pages
Item Analysis - Bayuin National High School (Slides Presentation)
No ratings yet
Item Analysis - Bayuin National High School (Slides Presentation)
26 pages
5 Test Development
No ratings yet
5 Test Development
30 pages
Vilca2021 Article SpanishVersionOfTheRevisedMent
No ratings yet
Vilca2021 Article SpanishVersionOfTheRevisedMent
25 pages
Of Small Beauties and Large Beasts
No ratings yet
Of Small Beauties and Large Beasts
33 pages
Developing A Testing Tool 2
No ratings yet
Developing A Testing Tool 2
24 pages
SIAS Fobia Sosial Kuesioner
100% (1)
SIAS Fobia Sosial Kuesioner
11 pages
Assessment in Learning 1: Melisa O. Derramas Educ2207
No ratings yet
Assessment in Learning 1: Melisa O. Derramas Educ2207
9 pages
Technical Report # 22: Analysis of Reading Fluency and Comprehension Measures For Third Grade Students
No ratings yet
Technical Report # 22: Analysis of Reading Fluency and Comprehension Measures For Third Grade Students
20 pages
CHAPTER 8 Clavillas Garma Garcia, J. Layog
No ratings yet
CHAPTER 8 Clavillas Garma Garcia, J. Layog
41 pages
Psychological Test Development Guide
No ratings yet
Psychological Test Development Guide
31 pages
Personality Assessment Methods
No ratings yet
Personality Assessment Methods
40 pages
Civic Knowledge, Civic Skills and Civic Engagement: Carmine Maiello, Fritz Oser & Horst Biedermann
No ratings yet
Civic Knowledge, Civic Skills and Civic Engagement: Carmine Maiello, Fritz Oser & Horst Biedermann
12 pages
Test Development-SR
No ratings yet
Test Development-SR
9 pages
Week 1 - Psychological Testing and Assessment
No ratings yet
Week 1 - Psychological Testing and Assessment
7 pages
Test Dev
No ratings yet
Test Dev
7 pages
The Measurement of Civic Scientific Literacy
No ratings yet
The Measurement of Civic Scientific Literacy
22 pages
Chemed
No ratings yet
Chemed
11 pages
Psychometric Scale Analysis
No ratings yet
Psychometric Scale Analysis
7 pages
Local and Foreign Literature
No ratings yet
Local and Foreign Literature
4 pages
Week 2 - A Statistics Refresher
No ratings yet
Week 2 - A Statistics Refresher
6 pages
History of Testing
No ratings yet
History of Testing
2 pages
Improving Multiple Choice Questions UNCCH
No ratings yet
Improving Multiple Choice Questions UNCCH
4 pages
Walstad 1997
No ratings yet
Walstad 1997
18 pages
J Educational Measurement - 2024 - Goldhammer - Does Timed Testing Affect The Interpretation of Efficiency Scores A GLMM
No ratings yet
J Educational Measurement - 2024 - Goldhammer - Does Timed Testing Affect The Interpretation of Efficiency Scores A GLMM
29 pages
7 Test Development
No ratings yet
7 Test Development
24 pages
Slide 6 - Test Construction and Adaptation
No ratings yet
Slide 6 - Test Construction and Adaptation
34 pages
Week 4 - Reliability
No ratings yet
Week 4 - Reliability
8 pages
Week 6 - Utility
No ratings yet
Week 6 - Utility
5 pages
Topic-12B-Test-Development-by-cohen 2
No ratings yet
Topic-12B-Test-Development-by-cohen 2
66 pages
Measuring Human Behaviour
No ratings yet
Measuring Human Behaviour
14 pages
Assessment Trans Chapter 8
No ratings yet
Assessment Trans Chapter 8
8 pages
Lecture 1 Aassessment
No ratings yet
Lecture 1 Aassessment
34 pages
Topic 12A TEST CONSTRUCTION by Gregory
No ratings yet
Topic 12A TEST CONSTRUCTION by Gregory
16 pages
Module 1-Basic Concepts and Principles in Assessing Learning
No ratings yet
Module 1-Basic Concepts and Principles in Assessing Learning
34 pages
PSYTEST
No ratings yet
PSYTEST
33 pages
Chapter 8
No ratings yet
Chapter 8
7 pages
3 Tibi Et Al 2020 - LK
No ratings yet
3 Tibi Et Al 2020 - LK
26 pages
BIO WELL The Development and Validation
No ratings yet
BIO WELL The Development and Validation
15 pages
Pa CH6
No ratings yet
Pa CH6
7 pages
MODULE 7. LESSON PROPER Psych Asses
No ratings yet
MODULE 7. LESSON PROPER Psych Asses
8 pages
Finals Psych Ass Reviewer
No ratings yet
Finals Psych Ass Reviewer
43 pages
Testing Beat Perception Without Sensory Cues To The Beat - The Beat-Drop Alignment Test (BDAT)
No ratings yet
Testing Beat Perception Without Sensory Cues To The Beat - The Beat-Drop Alignment Test (BDAT)
13 pages
PsychAssess 5 TestDevelopment
No ratings yet
PsychAssess 5 TestDevelopment
4 pages
Midterms Psychological Assessment 1
No ratings yet
Midterms Psychological Assessment 1
13 pages
Test Development
No ratings yet
Test Development
59 pages
AIED2025
No ratings yet
AIED2025
11 pages
Lesson 3 - Test Construction
No ratings yet
Lesson 3 - Test Construction
27 pages
Testing 3.2
No ratings yet
Testing 3.2
4 pages
4.chapter 4measurment and Scaling-1
No ratings yet
4.chapter 4measurment and Scaling-1
36 pages

Week 7 - Test Development

Uploaded by

Week 7 - Test Development

Uploaded by

Test Development Process

Test Conceptualization SOME PRELIMINARY QUESTIONS

Method of Paired Comparison ➢ Scalogram Analysis

Method of Equal-Appearing Intervals

➢ Comparative Scale → What range of content should the items cover?

Types of Selected-Response Item Format:

Writing Items for Computer Administration

➢ Computerized Adaptive Testing (CAT)

Class Scoring or Category Scoring

→ If 50 of the 100 examinees answered item 2

[1] The item-score standard deviation

→ → E.g., A multiple-choice item on an achievement

Item-Characteristic Curves GUESSING.

➢ For Discriminability Level:

➢ For Difficulty Level:

QUALITATIVE ITEM ANALYSIS

TEST REVISION AS A STAGE IN NEW TEST DEVELOPMENT

Qualitative Item Analysis

➢ One cautionary note: →

An existing test be kept in its present form as long

[1] Evaluating the properties of existing tests and guiding

[3] Developing item banks.

You might also like