A LASTING IMPACT High-Stakes Teacher Evaluations Drive Student Success in Washington, D.C

research A LASTING IMPACT High-stakes teacher evaluations drive student success in Washington, D.C TEACHERS MATTER—and some matter more than others. That recognition has driven a tidal wave of controversial policy reforms over the past decade, rooted in new evaluation systems that link teachers’ ratings and, in some cases, their pay and advancement to evidence of classroom practice and student learning. Two out of three U.S. states overhauled teacher evaluations between 2009 and 2015, supported by federal incentives such as Race to the Top and Teacher Incentive Fund grants, as well as No Child Left Behind Act waivers. What is the impact, so far, of these reforms? One common narrative would indicate a flop. Most states adopting new evaluation systems saw little change in the share of teachers deemed less than effective, argu- ably limiting their potential to address underperformance. Meanwhile, a broader backlash against reform, fueled by concerns about over- reliance on standardized tests, the accuracy of new evaluations, and the efficacy of performance-based incentives, has led some states to reverse course. Congress ultimately chose to exclude any requirements about teacher evaluation policies from the Every Student Succeeds Act of 2015, dashing some reformers’ hopes for a federal mandate. A closer look at one high-stakes evaluation system, however, shows the positive consequences such systems can have for students. Since 2012, we have been studying IMPACT, a seminal effort by the District of Columbia Public Schools (DCPS) to link teacher retention and pay to their performance. Under IMPACT, the district sets detailed standards for high-quality instruction, conducts multiple observations, assesses individual performance based on evidence of student progress, and retains and rewards teachers based on annual ratings. Looking across our analyses, we see that under IMPACT, DCPS has dramatically improved the quality of teaching in its schools—likely contributing to its status as the fastest-improving large urban school system in the United States as measured by the National Assessment of Educational Progress. Such reforms are often considered politically impossible, and the effort in DCPS shows the potential fallout. The district’s controversial chancellor, Michelle Rhee, resigned a year after IMPACT launched, when the mayor who appointed her, Adrian Fenty, lost a reelection by THOMAS DEE and JAMES WYCKOFF 58 EDUCATION NEXT / FALL 2017 educationnext.org Kaya Henderson was Chancellor of the District of Columbia Public Schools from 2010–2016. WASHINGTON POST WASHINGTON VOISIN,THE SARAH L. SARAH / H PHOTOGRAP educationnext.org FALL 2017 / EDUCATION NEXT 59 bid in a campaign focused on school reform. and are intended to drive improvements in But IMPACT outlasted them both, to teacher quality and student achievement the benefit of students. DCPS dismissed the (see “Capturing the Dimensions of Effective majority of very low performing teachers Teaching,” features, Fall 2012). and replaced them with teachers whose stu- Under IMPACT, all teachers receive a dents did better, especially in math. Other single score ranging from 100 to 400 points low-performing teachers were 50 percent at the end of each school year based on more likely to leave their jobs voluntarily, classroom observations, measures of student and those who opted to stay improved learning, and commitment to the school significantly, on average, the following community. However, the components year. High-performing teachers improved of the scores and the weights assigned to their performance as well, especially those them vary, depending on what measures within reach of the significant financial of student performance are available, and incentive created by the system. Certainly, DCPS has adjusted their composition over improvement was not universal, and some time. Below, we describe the components of very good teachers decided to leave the dis- IMPACT as it existed in its first three years. trict. Nonetheless, our analysis finds that All teachers were evaluated by five improved teaching was common and that All teachers receive structured classroom observations aligned student achievement increased as a result. a single IMPACT to the district’s Teaching and Learning The DCPS story shows that it may be Framework, which defined domains of politically challenging to adopt high-stakes score ranging from effective instruction, such as leading well- evaluation systems, but it is not impossible. 100 to 400 points organized, objective-driven lessons; check- And it shows that well-designed and care- ing for student understanding; explaining fully implemented teacher evaluations can at the end of each content clearly; and maximizing instruc- serve as an important district improvement school year based tional time. Of the five observations, three strategy—so long as states and districts are were conducted by an administrator and two also willing to make tough, performance- on classroom by “master educators” who traveled between based decisions about teacher retention, observations, several schools; only the first observation development, and pay. was announced in advance. measures of student The weight of these observation scores learning, and varied, depending on whether a teacher also A More-Rigorous Approach had a value-added score based on her stu- Teacher evaluations came into focus in commitment to the dents’ performance on standardized tests. 2009 alongside a growing research consen- school community. For those teachers—who led reading or math sus that teacher quality has dramatic, long- classrooms in grades 4‒8 and accounted for lasting impacts on student success. DCPS less than one in five DCPS teachers—observa- was in the vanguard of these efforts, and was tions were worth 35 percent and value-added one of the first school districts in the country was worth 50 percent. For the majority of to link individual performance evaluations DCPS teachers who did not have value-added to high-stakes decisions about pay and scores, observations counted for 75 percent retention when it launched IMPACT in the of the total IMPACT score. An additional 2009‒10 school year. 10 percent was based on progress toward IMPACT articulates clear standards for goals for student learning, which teachers effective instruction, provides instructional and administrators set each year. coaches to help teachers meet those standards, A teacher’s contribution to a school’s and evaluates teacher performance based in community, as assessed by the principal, large part on direct, structured observations was worth 10 percent of the overall evalu- of teachers’ classroom practices in addition ation score, while the final 5 percent was to evidence of student learning. IMPACT’s based on a measure of the value-added features are broadly consistent with emerging to student achievement for the school as best-practice design principles informed by a whole. Teachers could also have points the Measures of Effective Teaching project, deducted for failing to meet expectations 60 EDUCATION NEXT / FALL 2017 educationnext.org research TEACHER EVALUATIONS DEE & WYCKOFF for attendance, punctuality, and adherence years of IMPACT: 2009‒10, 2010‒11, and to other policies and procedures. 2011‒12. We limited our sample to general- education teachers working in schools serving K‒12 students. We also drew on an additional Telling Teachers Apart Unlike typical year of data, from the 2012‒13 school year, Unlike typical teacher-evaluation systems, teacher-evaluation in assessing IMPACT’s effects on student IMPACT creates substantial differentiation in achievement in tested grades and subjects. ratings. In 2009‒10 and 2010‒11, 14 percent of systems, IMPACT teachers were rated “highly effective,” 69 per- creates substantial cent of teachers were rated “effective,” 14 per- Teacher Retention cent were judged “minimally effective,” and differentiation and Performance another 2 percent were deemed “ineffective.” in ratings. DCPS teachers were retained differently The system also sets swift consequences depending on their IMPACT ratings. By the for teachers with very low or very high scores. program’s third year, 95 percent of teach- Teachers rated “ineffective” are dismissed ers deemed “ineffective” in the system’s first with rare exception, a rule that caused more two years had been dismissed (see Figure than 200 teachers to exit DCPS in IMPACT’s 1). Overall, 3.8 percent of all teachers in the first year. Those rated “minimally effective” district were let go as a result of being rated are given one year to raise their score above “ineffective” once or after earning two con- the “effective” threshold; if they do not, they secutive “minimally effective” ratings under are dismissed. IMPACT between 2009‒10 and 2011‒12. On the other side of the spectrum, teach- Under IMPACT, lower-rated teachers were ers are eligible for a bonus of up to $25,000 in each year they are rated “highly effective;” those Differential Teacher Retention Under IMPACT (Figure 1) with “highly effective” scores two or more years in a row can earn Ninety-five percent of teachers identified as “ineffective” under IMPACT increases in their base pay of up to were dismissed. Among those identified as “minimally effective,” 14 percent $27,000. Teachers working in high- were dismissed and 27 percent voluntarily left DCPS. Among “effective” and poverty schools teaching high-need “highly effective” teachers, the majority stayed at DCPS. subjects are eligible for the largest

A LASTING IMPACT High-Stakes Teacher Evaluations Drive Student Success in Washington, D.C

The Association for Education Finance and Policy 41St Annual Conference March 17-19, 2016 Denver Marriott City Center, Denver, Colorado

Navigating the Job Market

School Turnarounds: Evidence from the 2009 Stimulus

Nber Working Paper Series the Impact of No Child Left

Student-Teacher Race Match in Charter and Traditional Public Schools

Incentives, Selection, and Teacher Performance: Evidence from Impact

Seventy -Fifth Annual Commencement June 13) 1969

Were All Those Standardized Tests for Nothing? the Lessons of No Child Left Behind

Jonah E. Rockoff [email protected] Graduate School of Business, Columbia University Tel

The Effects of Accountability Incentives in Early Childhood Education

View, Download, and Print IES Presentations at AEFP As a PDF File

The Federal Role in Improving Educational Productivity