Teacher assessments during compulsory education are as reliable, stable and heritable as standardized test scores

J Child Psychol Psychiatry. 2019 Dec;60(12):1278-1288. doi: 10.1111/jcpp.13070. Epub 2019 May 12.

Abstract

Background: Children in the UK go through rigorous teacher assessments and standardized exams throughout compulsory (elementary and secondary) education, culminating with the GCSE exams (General Certificate of Secondary Education) at the age of 16 and A-level exams (Advanced Certificate of Secondary Education) at the age of 18. These exams are a major tipping point directing young individuals towards different lifelong trajectories. However, little is known about the associations between teacher assessments and exam performance or how well these two measurement approaches predict educational outcomes at the end of compulsory education and beyond.

Methods: The current investigation used the UK-representative Twins Early Development Study (TEDS) sample of over 5,000 twin pairs studied longitudinally from childhood to young adulthood (age 7-18). We used teacher assessment and exam performance across development to investigate, using genetically sensitive designs, the associations between teacher assessment and standardized exam scores, as well as teacher assessments' prediction of exam scores at ages 16 and 18, and university enrolment.

Results: Teacher assessments of achievement are as reliable, stable and heritable (~60%) as test scores at every stage of the educational experience. Teacher and test scores correlate strongly phenotypically (r ~ .70) and genetically (genetic correlation ~.80) both contemporaneously and over time. Earlier exam performance accounts for additional variance in standardized exam results (~10%) at age 16, when controlling for teacher assessments. However, exam performance explains less additional variance in later academic success, ~5% for exam grades at 18, and ~3% for university entry, when controlling for teacher assessments. Teacher assessments also predict additional variance in later exam performance and university enrolment, when controlling for previous exam scores.

Conclusions: Teachers can reliably and validly monitor students' progress, abilities and inclinations. High-stakes exams may shift educational experience away from learning towards exam performance. For these reasons, we suggest that teacher assessments could replace some, or all, high-stakes exams.

Keywords: Educational achievement; quantitative genetics; standardized exams; teacher assessment; twin models.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Twin Study

MeSH terms

  • Adolescent
  • Child
  • Educational Measurement / standards*
  • Educational Status*
  • Female
  • Humans
  • Longitudinal Studies
  • Male
  • School Teachers / standards*
  • Schools*
  • Students*
  • United Kingdom