Assess 1 Module
Assess 1 Module
COLLEGE OF TEACHER
EDUCATION
Prepared by:
Name
Sched code/class
schedule
Address
INTRODUCTION
In this Chapter we are concerned with developing paper-and- paper tests for
assessing the attainment of educational objectives based on Bloom's taxonomy. Paper-
and-pencil test can either be selected response or constructed-response types. Selected
response type includes (a) true-false items, (b) multiple-choice type items and (c)
matching type. Constructed-response type of test includes (a) enumeration, (b)
completion and (c) essays.
The construction of valid test items begins with a Table of Specifications.
Deciding on the type of objective test. The test objectives guide the kind of objective
tests that will be designed and constructed by the teacher. This means aligning the test
with the lesson objective/outcome.
For instance, for the first four (4) levels, we may want to construct a multiple-choice type
of test while for application and judgment, we may opt to give an essay test or a modified
essay test. At all times, the test to be formulated must be aligned with the learning
outcome. This is the principle of constructive alignment.
The simplest TOS consists of four (4) columns: (a) level of objective to be tested, (b)
statement of objective, © item numbers where such an objective is being tested and (d)
number of items and percentage out of the total for that particular objective. A prototype
table is shown below:
Constructing the test items. The actual construction of the test items follows the TOS.
As a general rule, it is advised that the actual number of items to be constructed in the
draft should be double the desired number of items.
For instance, if there are five (5) recall level items to be included in the final test form,
then at least ten (10) recall level items should be included in the draft The subsequent
test try-out and item analysis will most likely eliminate many of the constructed items in
the draft (either they are too difficult, too easy or non-discriminatory), hence it will be
necessary to construct more items than will actually be included in the final test form.
Most often, however, the try-out is not done due to lack of time.
Item analysis and try-out. The test draft is tried out to a group of pupils or students.
The purpose of this try-out is to determine the: (a) item characteristics through item
analysis, and (b) characteristics of the test itself-validity, reliability and practicality.
A student who knows nothing of the content of the examination would have
50% chance of getting the correct answer by sheer guess work. Although correction-for-
guessing formulas exist, it is best that the teacher ensures that a true-false item is able to
discriminate properly between those who know and those who are just guessing. A
modified true-false test can offset the effect of guessing by requiring students to explain
their answer and to disregard a correct answer if the explanation is incorrect. Here are
some rules of thumb in constructing true-false items.
Rule 2. Avoid using the words "always," "never, "often" and other
words that tend to be either always true or always false.
Example: Tests need to be valid, reliable and useful, although, it would require
a great amount of time and effort to ensure that tests possess these test characteristics.
.
Notice that the statement is true. However, we are also not sure which part of
the sentence is deemed true by the student. It is just fortunate that in this case, all parts
of the sentence are true and, hence, the entire sentence is true. The following example
illustrates what can go wrong in long sentences:
Example: Tests need to be valid, reliable and useful since it takes very little
amount of time, money and effort to construct tests with these characteristics. .
The first part of the sentence is true but the second part is debatable and may,
in fact, be false. Thus, a true" response is correct and, also, a "False" response is correct.
1. T 6. 6. F 1. 1. T 6. 6. F
2. F 7. 7. F 2.2. F 7. 7. T
3. F 8. 8. F OR 3. 3. T 8. 8. F
4. F 9. 9. F 4. 4. F 9. 9. T
5. F 1 10. F 5. 5. T 1 10. F
For ease of correction, teachers sometime create a pattern of True or False answers.
Students will sense it and may arrive at a correct answer not because he/she really knows
the answer but because he/she senses the pattern.
This makes test item unclear and definitely will confuse the student. e.g. The changes that
take place in early childhood are NOT Unchangeable. The test item simply means "The
changes in early childhood are changeable".
The distracters are chosen in such a way that they are attractive to those who
do not know the answer or who are guessing but at the same time, have no appeal to
those who actually know the answer. It is this feature of multiple choice type tests that
allows the teacher to test higher order thinking skills even if the options are clearly stated.
As in true false items, there are certain rules of thumb to be followed in constructing
multiple choice tests.
The ability of the item to discriminate or its level of difficulty should stem from
the subject matter rather than from the wording of the question.
2) Do not use modifiers that are vague and whose meanings can differ from one
person to the next such as: much, often, uxually, etc.
The qualifier "much" is vague and could have been replaced by more specific qualifiers
like: "90% of the photosynthetic process" or some similar phrase that would be more
precise. Be quantitative.
Example:
(Better) Who was the President of the Philippines after Corazon C. Aquino?
Example:
(Poor) Which of the following will not cause inflation in the Philippine economy?
(Better) Which of the following will cause inflation in the Philippine economy?
Poor: What does the statement "Development patterns. acquired during the
formative years are NOT Unchangeable" imply? A. B. C D Better: What does the statement
"Development patterns acquired during the formative years are changeable" imply?
A.
B.
C.
D.
5) Each item stem should be as short as possible; otherwise you risk testing
more for reading and comprehension skills.
The short story: May Day's Eve, was written by which Filipino author?
If distracters had all been Filipino authors, the value of the item would be
greatly increased. In this particular instance, only the first three carry the burden of the
entire item since the last two can be essentially disregarded by the students.
7) All multiple choice options should be grammatically consistent with the stem.
Example:
Option A, B and C are obviously wrong for the language smart because when added to the
stem the sentence is grammatically wrong. D is the only option which when connected to
the stem retains the grammatical accuracy of the sentence, thus obviously is the correct
answer.
Example:
a. Who will mosť strongly disagree with the progressivist who claims that the child should
be taught only that which interests him and if he is not interested, wait till the child gets
interested?
A. Essentialist C. Progressivist
B. Empiricist D. Rationalist
One may arrive at a correct answer (letter b) by looking at item "a," that gives the
answer to "b."
10) Avoid alternatives that are synonymous with others or those that include or
overlap others.
Example:
What causes ice to transform from solid state to liquid state?
a. Change in temperature
b. Changes in pressure c
. Change in the chemical composition
d. Change in heat levels
The options "a" and "d" are essentially the same. Thus, a student who spots
these identical choices would right away narrow down the field of choices to a, b, and c.
The last distracter would play no significant role in increasing the value of the item.
If this happens then the item has two answers, which is not acceptable.
11) Avoid presenting sequenced items in the same order as in the text.
12) Avoid use of assumed qualifiers that many examinees may not be aware of.
13) Avoid use of unnecessary words or phrases which are not relevant to the
problem at hand (unless such discriminating ability is the primary intent of the
evaluation). The item's value is particularly damaged if the unnecessary material is
designed to distract or mislead. Such items test the student's reading comprehension
rather than knowledge of the subject matter.
Example:
The side opposite the thirty degree angle in a right triangle is equal to half the length of
the hypotenuse. If the sine of a 30-degree is 0.5 and its hypotenuse is 5, what is the
length of the side opposite the 30-degree angle?
a. 2.5
b. 3.5
c. 5.5
d. 1.5
The sine of a 30-degree angle is really quite unnecessary since the first sentence already
gives the method for finding the length of the side opposite the thirty-degree angle. This is
a case of a teacher who wants to make sure that no student in his class gets the wrong
answer!
Note in the previous example, knowledge of the sine of the 30-degree angle
would have led some students to use the sine formula for calculation even if a simpler
approach would have sufficed.
15) Pack the question in the stem. Here is an example of a question which has
no question. Avoid it by all means.
Example:
с. had no heroes
16) Use the "None of the above" option only when the keyed answer is totally
correct. When choice of the "best" response is intended, "none of the above" is not
appropriate, since the implication has already been made that the correct response may
be partially inaccurate.
17) Note that use of "all of the above" may allow credit for partial knowledge. In
a multiple option item, (allowing only one option choice) if a student only knew that two
(2) options were correct, he could then deduce the correctness of "all of the above." This
assumes you are allowed only one correct choice.
18) Better still use "none of the above" and "all of the above" sparingly but best
not to use them at all.
Example:
a. Southeast Asia
b. Eastern Europe
c. South America
d. East Africa
e. Central America
The matching type items may be considered modified multiple choice type items where
the choices progressively reduce as one successfully matches the items on the left with
the items on the right.
Here are some guidelines to observe in the formulation of good matching type of test.
1. Match homogeneous not heterogeneous items.
The items to match must be homogeneous. If you want your students to match
authors with their literary works, in one column will be authors and in the second column
must be literary works. Don't insert nationality for instance with names of authors. That
will not be a good item since it is obviously wrong.
Example of homogeneous items. The items are all about the Filipino heroes, nothing
more. Match the items in Column A with the items in Column B.
2. The stem (longer in construction than the options) must be in the first column
while the options (usually shorter) must be in the second column.
3. The options must be more in number than the stems to prevent the student
from arriving at the answer by mere process of elimination.
4. To help the examinee find the answer easier, arrange the options
alphabetically or chronologically, whichever is applicable.
5. Like any other test, the direction of the test must be given. The examinees
must know exactly what to do.
Matching type items, unfortunately, often test lower order thinking skills (knowledge level)
and are unable to test higher order thinking skills such as application and judgement
skills. Another type of a matching type of test is the imperfect type.
Column A Column B
1. The Alhambra A. Cooper
2. The Pioneers B. Dana
3. The Guardian Angel C. Emerson
4. Two Years Before the Mast D. Holmes
5. Moby Dick E. Irving
6. The World in a Man of War F. James
7. The Last of the Mohicans G. Melville
8. The American Scholar H. Mark Twains (Clemens)
9. The Autocrat of the Breakfast Table I. Wharton
10. Tom Sawyer
If you intend to make use of this imperfect type of matching test, make sure you indicate
so in the "Direction" to caution the students who usually think that an answer may not be
repeated.
Supply type tests depend heavily on the way the stems are constructed. These
tests allow for one and only one answer and, hence, often test only the students' recall of
knowledge.
The appropriate synonym for the first is CHANGE with six (6) letters while the
appropriate synonym for the second is GROW with four (4) letters. Notice that these
questions require not only mere recall of words but also understanding of these words.
Another example of a completion type of text that measures higher order thinking
skill is given below:
A B
1. Square root of 9 a. -3
2. Square of 25 b. 615
3. 36 inches c. 3 meters
4. 4 feet d. 48 inches
5. 1 kilogram e. I pound
1. Avoid over mutilated sentences like this test item. Give enough clue to the
student.
The produced by the to change the process is called and is used by the
green to change the and into . This process is called
.
2. Avoid open-ended item. There should be only one acceptable answer. This item is
open-ended, hence no good test item.
3. The blank should be at the end or near the end of the sentence. The question must first
be asked before an answer is expected. Like the matching type of test, the stem (where
the question is packed) must be in the first column.
There are other more significant items to ask other than specific birthdates.
5. The length of the blanks must not suggest the answer. So better to make the blanks
uniform in size.
5.4.2 Essays
Essays, classified as non-objective tests, allow for the assessment of higher order
thinking skills. Such tests require students to organize their thoughts on a subject matter
in coherent sentences in order to inform an audience. In essay tests, students are required
1. Comparing
3. Justifying
4. Summarizing
State the points included in …
Briefly summarize the contents of 5. Generalizing
6. Inferring
In the light of the facts presented, what is most likely to happen when...
How would Senator X be most likely to react to the bomb explosion after the bar
examination last September?
7. Classifying
Group the following items according to…
What do the following items have in common?
8. Applying
Using the principles of as guide, describe how you would solve the following
problem situation. - Describe a situation that illustrates the principle of .
9. Analyzing
10. Evaluating
Describe the strengths and weaknesses of the following…
Using the criteria developed in class, write an evaluation of ...
11. Creating
Examples are asking students to "write an example," "list three reasons" or "compare and
contrast two techniques."
Sample Short Response Question
(10th Grade Reading)
How are the scrub jay and the mockingbird
different? Support your answer with details and
information from the article.
Non-restricted /Extended Essay- Extended responses can be much longer and complex
than short responses, but students are encouraged to remain focused and organized.
Sample Extended Response Question
(5th Grade Science)
Robert is designing a demonstration to display at his school's science fair. He will show
how changing the position of a fulcrum on a lever changes the amount of force needed
to lift an object. To do this, Robert will use a piece of wood for a lever and a block of
wood to act as a fulcrum. He plans to move the fulcrum to different places on the lever
to see how its placement affects the force needed to lift an object.
Part - A Identify at least two other actions that would make Robert's demonstration
better.
Part B Explain why each action would improve the demonstration.
(Source: https://fcit.usf.edu/assessment/constructed/constructb.html)
Note that all these involve the higher-level skills mentioned Bloom's Taxonomy.
Rule 1: Phrase the direction in such a way that students are guided on the
key concepts to be included. Specify how the students should respond.
Example: Using details and information from the article (Hundred Islands), summarize
the main points of the article. For a complete and correct response, consider these
points:
• its history (10 pts)
• its interesting features (10 pts)
• why it is a landmark (5 pts)
Non-example: Using details and information from the article (Hundred Islands)
summarize the main points of the article.
Source: https:/fcit.usf.edu/assessment/constructed/constructb.html
Rule 2: Inform the students on the criteria to be used for grading their
essays. This rule allows the students to focus on relevant and substantive materials
rather than on peripheral and unnecessary facts and bits of information.
Rule 4: Decide on your essay grading system prior to getting the essays of
your students.
Rule 6: Evaluate answers to essay questions without knowing the identity of the
writer.
This is another attempt to control personal bias during scoring. Answers to
essay questions should be evaluated in terms of what is written, not in terms of what is
known about the writers from other contacts with them. The best way to prevent our prior
knowledge from influencing our judgment is to evaluate each answer without knowing the
identity of the writer. This can be done by having the students write their names on the
back of the paper or by using code numbers in place of names.
Rule 7: Whenever possible, have two or more persons grade each answer.
The best way to check on the reliability of the scoring of essay answers is to
obtain two or more independent judgments. Although this may not be a feasible practice
for routine classroom testing, it might be done periodically with a fellow teacher (one who
is equally competent in the area). Obtaining two or more independent ratings becomes
especially vital where the results are to be used for important and irreversible decisions,
such as in the selection of students for further training or for special awards Here the
pooled ratings of several competent persons may be needed to attain level of reliability
that is commensurate with the significance of the decision being made.
Some teachers use the cumulative criteria i.e. adding the weights given to
each criterion, as basis for grading while others use the reverse. In the latter method,
each student begins with a score of 100. Points are then deducted every time a
teacher encounters a mistake or when a criterion is missed by the student in his
essay. Rule 8: Do not provide optional questions. It is difficult to construct questions
of equal difficulty and so teacher cannot have valid comparison of students'
achievement. Rule 9: Provide information about the value/weight of the question and
how it will be scored. Rule 10: Emphasize higher level thinking skills.
Non Example:
Considering the influence of ocean temperatures, explain why inland temperatures
vary in summer and winter to a greater degree than coastal temperatures. List three
coastal land masses.
MENTAL EXERCISES:
Objective:
Construct a Table of Specifications (ToS) for the selected grading period. The ToS will
align the test content with the learning competencies outlined in the K-12 English
Curriculum for a specific grade level.
Directions:
1. Organize students into four groups (A, B, C, D). Each group will do the
task.
2. Select a Grade Level and Quarter:
Choose one grade level between Grades 7 to 10, and select the quarter
(e.g., 1st, 2nd, 3rd, or 4th quarter) for which you will create the test. The
test must be based on the competencies of that quarter as outlined in the K
to 12 Curriculum Guide for English.
3. Review the Curriculum Guide:
Refer to the K-12 English Curriculum Guide for the selected grade level and
quarter. Identify the learning competencies, content standards, and
Objective:
To collaboratively construct and refine multiple-choice and matching-type test items
that are aligned with the Table of Specifications (TOS), covering various learning
objectives and cognitive levels.
Instructions:
Phase 1: Individual Construction Based on the TOS
1. Each student constructs two test items:
o One multiple-choice item:
The item should align with a specific learning objective and
cognitive level as outlined in the TOS.
Include a clear stem and four answer choices: one correct
answer and three distractors.