Assessment design is one of the most important responsibilities instructors take on. A well-crafted assessment gives students the chance to demonstrate genuine learning. A poorly designed one risks confusing, discouraging, or even misrepresenting what students know.
So how do we know whether an assessment is good? Three qualities stand out: validity, reliability, and alignment
Validity asks the fundamental question: Does this assessment measure what it claims to measure?
Validity forces us to match the tool to the task. An invalid assessment may produce data, but it’s data about the wrong thing.
Practical tip: Review each assessment in your course and write down the specific learning outcome it is supposed to measure. If the match feels weak, it’s time to revise.
While validity ensures we are measuring the right thing, reliability ensures we are measuring it fairly.
Reliability is about consistency: two graders using the same rubric should assign similar scores; a student retaking the assessment under the same conditions should produce similar results.
Reliability matters because without it, grades lose their meaning. Students need to trust that their work is evaluated consistently.
Practical tip: Develop rubrics with clear criteria and performance levels. Pilot test your assessment if possible, or compare scores across graders to see where discrepancies arise.
An assessment can be valid but unreliable (measuring the right skill inconsistently) or reliable but invalid (consistently measuring the wrong skill). Effective assessments require both.
For example:
Alignment connects learning goals, outcomes, objectives, and assessments into a coherent whole.
Assessments that align with these objectives ensure students are evaluated on what the course actually intends to teach.
Practical tip: Try the alignment test. For each learning objective, ask: What assessment task would directly demonstrate this? If none exist, you may need to add or revise assessments.
One of the most effective methods for ensuring validity, reliability, and alignment is backward design (Wiggins & McTighe, 2005). Instead of starting with content or activities, backward design flips the process:
Backward design prevents the all-too-common mistake of teaching first and scrambling later to “fit” assessments into the course.
To start applying these principles:
Valid, reliable, and aligned assessments are the foundation of fairness and effectiveness. But they’re not the whole story. We’ll explore how to choose the right strategies for your teaching context, because what works in a large STEM lecture might fail in a small humanities seminar and vice versa.