TNE


 

TNE Virtual Library
Design Principle A:

Decisions Driven by Evidence
1.    Drawing Upon Research
2.    The Role of Pupil Learning


High-Stakes Testing, Uncertainty, and Student Learning

Audrey L. Amrein, David C. Berliner
Education Policy Analysis Archive (Vol. 10, No. 18)
March 28, 2002

“A brief history of high-stakes testing is followed by an analysis of eighteen states with severe consequences attached to their testing programs. These 18 states were examined to see if their high-stakes testing programs were affecting student learning, the intended outcome of high-stakes testing policies promoted throughout the nation…. While a state's high-stakes test may show increased scores, there is little support in these data that such increases are anything but the result of test preparation and/or the exclusion of students from the testing process.… Because clear evidence for increased student learning is not found, and because there are numerous reports of unintended consequences associated with high-stakes testing policies (increased drop-out rates, teachers' and schools' cheating on exams, teachers' defection from the profession, all predicted by the uncertainty principle), it is concluded that there is need for debate and transformation of current high-stakes testing policies.”

Sizing Up Test Scores
Dale Ballou
Education Next (Vol. 2, No. 2)
Summer 2002

“One of the basic critiques of using test scores for accountability purposes has always been that simple averages, except in rare circumstances, don’t tell us much about the quality of a given school or teacher…. The more serious difficulties arise when value-added assessments are used to hold schools and teachers accountable, with high-stakes personnel decisions to follow. The danger is that such assessments will be used to supplant local decisionmaking, rather than to inform it.”

Measuring the Impacts of Whole-School Reforms: Methodological Lessons from an Evaluation of Accelerated Schools
Howard S. Bloom
Manpower Demonstration Research Corporation
U.S. Department of Education, Planning and Evaluation Service
October 2001

“The goal of the present paper is to introduce education researchers to a new approach for measuring the impacts of whole-school reforms. The approach is based on interrupted time-series analysis, which has been used to evaluate programs in many fields but has not been used widely to study education initiatives. The application presented measures program impacts on student performance by comparing standardized test scores for a number of annual student cohorts in a specific grade after a reform is launched (its follow-up period) with the scores of cohorts from several years before the reform was launched (its baseline period). The approach is used to measure impacts on three facets of student performance: (1) average (mean) test scores, which summarize impacts on total performance; (2) the distribution of scores across specific ranges, which helps to identify where in the distribution of student performance impacts were experienced; and (3) the variation (standard deviation) of scores, which indicates how the disparity in student performance was affected. To help researchers use the approach, the paper lays out its conceptual rationale, describes its statistical procedures, explains how to interpret its findings, indicates its strengths and limitations and illustrates how it was used to evaluate a major whole-school reform model—Accelerated Schools.”

The Virtues of Randomness:
Education Discovers Randomized Field Trials

Robert Boruch
Education Next (Vol. 2, No. 3)
Fall 2002

"Randomized trials will only become more frequent in the education arena as time goes on. Today's reform environment demands hard evidence on which programs and policies are effective at raising student achievement. So far, randomized trials provide the best way of obtaining such evidence, and their techniques become more sophisticated by the day. The question is whether these results will be used to effect change or be held hostage to the ideological posturing that so often substitutes for evidence in the education world."

The Full Circle: Building a Coherent Teacher Preparation System.
The Report of the NASBE Study Group on Coordination and Accountability in Teacher Education

Carla Claycomb, Jeanne Pecori
National Association of State Boards of Education
October 2000

“More and more, teachers, policymakers, parents, and others are realizing that student achievement need not be prescribed by socioeconomic status, parent involvement, or race and ethnicity; on the contrary, recent evidence makes clear that regardless of the factors that students bring to school, good teachers measurably increase student learning, and good schools foster high levels of student achievement in large part because of the quality of their teachers. As a matter of fact, teacher quality may be one of the most significant factors in student achievement.”

Measuring What Matters: An Update on Educational Assessment and Accountability
Committee for Economic Development
August 2002

"In this 11-page brief, the Committee for Economic Development (CED) examines the No Child Left Behind Act's (NCLB) key assessment and accountability provisions and issues surrounding their implementation. The report is a follow-up to a CED tract (Measuring What Matters: Using Assessment and Accountability to Improve Student Learning) that was released early last year - while NCLB was still being tossed around in Congress and standards and testing systems were more matters of state than federal policy. If NCLB is to improve student learning, the new report says, it must meet a trio of challenges (all pretty obvious yet critical): 1) test results must measure and report student achievement accurately; 2) educators must teach solid content, not just 'teach to the test'; and 3) low-performing schools must be given extra assistance. Following an explanation of each challenge is a list of related issues to be monitored and questions that policymakers ought to ask when designing and reforming standards and accountability systems."

Defining "Highly Qualified Teachers": What Does "Scientifically Based Research" Actually Tell Us?
Linda Darling-Hammond, Peter Youngs
Educational Researcher (Vol. 31, No. 9)
December 2002

"In this report titled 'Meeting the Highly Qualified Teachers Challenge' the Secretary essentially argues for the dismantling of teacher education systems and the redefinition of teacher qualifications to include little preparation for teaching. . . The report suggests that its recommendations are based on 'solid research.' However, none of these arguments has strong empirical support, and the report does not cite the scientific literature that addresses them."

Teacher Quality and Student Achievement: A Review of State Policy Evidence
Linda Darling-Hammond
Education Policy Analysis Archives (Vol. 8, No. 1)
January 1, 2000

“Using data from a 50-state survey of policies, state case study analyses, the 1993-94 Schools and Staffing Surveys (SASS), and the National Assessment of Educational Progress (NAEP), this study examines the ways in which teacher qualifications and other school inputs are related to student achievement across states. The findings of both the qualitative and quantitative analyses suggest that policy investments in the quality of teachers may be related to improvements in student performance. Quantitative analyses indicate that measures of teacher preparation and certification are by far the strongest correlates of student achievement in reading and mathematics, both before and after controlling for student poverty and language status. State policy surveys and case study data are used to evaluate policies that influence the overall level of teacher qualifications within and across states. This analysis suggests that policies adopted by states regarding teacher education, licensing, hiring, and professional development may make an important difference in the qualifications and capacities that teachers bring to their work. The implications for state efforts to enhance quality and equity in public education are discussed.”

Legislating equity: The distribution of emergency permit teachers in California
Laura Goe
Education Policy Analysis Archives (Vol. 10, No. 42)
October 14, 2002

“There is a significant negative relationship between the percentage of teachers on emergency permits and student achievement at the school level in California schools, after controlling for other student and school characteristics. Generally, the more emergency permit teachers there are in a school, the lower the school's achievement.  This phenomenon is examined in the context of other contributors to student achievement such as socio-economic status and school size.  The effects of teacher distribution and school selection as contributing factors are considered.   In addition, policy and legislative initiatives related to emergency permit teachers that have been recently debated in California will be discussed. Finally, a set of initiatives is proposed that attempt to decrease the need for emergency permit teachers and ensure that those that must be hired due to shortage conditions have the support they need to become credentialed teachers.”

The Business Model
Jay P. Greene
Education Next (Vol. 2, No. 2)
Summer 2002

“Like the makers of hot dogs, psychometricians, economists, and other testing experts know too well what goes into the creation of achievement tests. Their intimate knowledge of the technical difficulties involved in measuring student achievement makes a number of these testing experts some of the most vocal (and persuasive) opponents of testing. But the flaws in techniques like value-added assessment do not automatically lead to the conclusion that those techniques shouldn’t be used to hold educators accountable. Testing may be imperfect, but the alternative—the old system, which allowed us to know very little about the performance of educators—is far, far worse.”

A Knowledge Base for the Teaching Profession: What Would It Look Like and How Can We Get One?
James Hiebert, Ronald Gallimore, James W. Stigler
Educational Researcher (Vol. 31, No. 5)
June/July 2002

“To improve classroom teaching in a steady, lasting way, the teaching profession needs a knowledge base that grows and improves. In spite of the continuing efforts of researchers, archived research knowledge has had little effect on the improvement of practice in the average classroom. We explore the possibility of building a useful knowledge base for teaching by beginning with practitioners' knowledge. We outline key features of this knowledge and identify the requirements for this knowledge to be transformed into a professional knowledge base for teaching. By reviewing educational history, we offer an incomplete explanation for why the United States has no countrywide system that meets these requirements. We conclude by wondering if U.S. researchers and teachers can make different choices in the future to enable a system for building and sustaining a professional knowledge base for teaching.”

District fiscal policy and student achievement: Evidence from combined NAEP-CCD data
Gary G. Huang, Binbing Yu
Education Policy Analysis Archives (Vol. 10, No. 38)
September 22, 2002

“School restructuring raises questions about the role of school districts in improving student learning. Centralization by state governments and decentralization to individual schools as proposed in systemic reform leave districts' role unsettled. Empirical research on the district role in the context of ongoing reform is inadequate. This analysis of combined data from the NAEP and the Common Core of Data (CCD) was intended to address the issue… The null effect was consistent in the analysis of the combined NAEP-CCD data for 1990, 1992, and 1996. In contrast, current expenditure per pupil was found related to higher math performance in a modest yet fairly consistent way. Future research may be productive to separately study individual states and integrate the findings onto the national level.”

Class Size Reduction, Teacher Quality, and Academic Achievement in California Public Elementary Schools
Christopher Jepsen, Steven Rivkin
Public Policy Institute of California
2002

“In 1996, California passed a statewide class size reduction (CSR) law that aimed to reduce average class sizes by roughly one-third in kindergarten through third grade. Educators and policymakers expected CSR to lead to large gains in student achievement. However, increasing the state’s teaching workforce by thousands of new teachers had the potential to offset the direct benefits of smaller classes, particularly for schools in economically disadvantaged communities that already had staffing difficulties. This report addresses the following questions: What were the effects of CSR on overall teacher experience, certification, and education? Were some schools affected more than others?; How did CSR affect student achievement? What were the benefits of smaller classes? What were the effects of new teachers?; Are the benefits of smaller classes concentrated among a subset of students, or did all schools benefit equally from CSR?”

Better Teachers, Better Schools
Marci Kanstoroom, Chester E. Finn, Jr., Editors
Thomas Fordham Foundation
July 1999

“This book suggests a different way of thinking about this perplexing but important issue, a way of thinking that's grounded in common sense rather than piety and that relies on evidence rather than supposition or wishful thinking. The common sense approach that we propose to boosting teacher quality involves easing back on regulations that control entry, devolving personnel decisions to individual schools, and then holding those schools accountable for producing results as gauged by their pupils' academic achievement. If this ‘tight-loose’ strategy sounds familiar, that's because it has become the dominant paradigm for reforming schools today. We think it warrants consideration for teachers, too.”

The Effectiveness of "Teach for America" and Other Under-certified Teachers on Student Academic Achievement: A Case of Harmful Public Policy
Ildiko Laczko-Kerr, David C. Berliner
Education Policy Analysis Archives (Vol. 10, No. 37)
September 6, 2002

“The academic achievements of students taught by under-certified primary school teachers were compared to the academic achievements of students taught by regularly certified primary school teachers.  This sample of under-certified teachers included three types of under-qualified personnel: emergency, temporary and provisionally certified teachers.  One subset of these under-certified teachers was from the national program Teach For America (TFA).  Recent college graduates are placed by TFA where other under-qualified under-certified teachers are often called upon to work, namely, low-income urban and rural school districts. Certified teachers in this study were from accredited universities and all met state requirements for receiving the regular initial certificate to teach.  Recently hired under-certified and certified teachers (N=293) from five low-income school districts were matched on a number of variables, resulting in 109 pairs of teachers whose students all took the mandated state achievement test. Results indicate 1) that students of TFA teachers did not perform significantly different from students of other under-certified teachers, and 2) that students of certified teachers out-performed students of teachers who were under-certified.  This was true on all three subtests of the SAT 9—reading, mathematics and language arts.  Effect sizes favoring the students of certified teachers were substantial.  In reading, mathematics, and language, the students of certified teachers outperformed students of under-certified teachers, including the students of the TFA teachers, by about 2 months on a grade equivalent scale.  Students of under-certified teachers make about 20% less academic growth per year than do students of teachers with regular certification.  Traditional programs of teacher preparation apparently result in positive effects on the academic achievement of low-income primary school children.  Present policies allowing under-certified teachers, including those from the TFA program, to work with our most difficult to teach children appear harmful.  Such policies increase differences in achievement between the performance of poor children, often immigrant and minority children, and those children who are more advantaged.”

Accountability Systems: Implications of Requirements of the No Child Left Behind Act of 2001
Robert L. Linn, Eva L. Baker, Damian W. Betebenner
Educational Researcher (Vol. 31, No. 6)
August/September 2002

“The No Child Left Behind Act of 2001 substantially increases the testing requirements for states and sets demanding accountability standards for schools, districts, and states with measurable adequate yearly progress (AYP) objectives for all students and subgroups of students defined by socioeconomic background, race/ethnicity, English language proficiency, and disability. However, states' content standards, the rigor of their tests, and the stringency of their performance standards vary greatly. Consequently, the percentage of students who score at the proficient level or higher on the state assessments varies radically from state to state. Some states have farther to go than others to meet the mandated target of 100% proficient within 12 years. These differences are illustrated and the implications for achieving AYP targets are discussed. Also addressed are possible uses of results from the biennial state-level administrations of the National Assessment of Educational Progress as a means of leveling the playing field. Factors contributing to the volatility of gains in achievement from year to year for individual schools are discussed.”

Consequences of Assessment: What is the Evidence?
William A. Mehrens.
Education Policy Analysis Archive (Vol. 6, No. 13)
July 14, 1998

“Attention is here directed toward the prevalence of large scale assessments (focusing primarily on state assessments). I examine the purposes of these assessment programs; enumerate both potential dangers and benefits of such assessments; investigate what the research evidence says about assessment consequences (including a discussion of the quality of the evidence); discuss how to evaluate whether the consequences are good or bad; present some ideas about what variables may influence the probabilities for good or bad consequences; and present some tentative conclusions about the whole issue of the consequences of assessment and the amount of evidence available and needed.”

The Class Size Debate
Lawrence Mishel, Richard Rothstein, editors 
Economic Policy Institute
2002

“In The Class Size Debate, two eminent economists debate the merits of smaller class sizes and the research methods used to measure the efficacy of this education reform measure. Alan Krueger (Princeton University) maintains that smaller class sizes can improve students’ performance and future earnings prospects. He challenges Eric Hanushek’s (Stanford University) widely cited analysis of the class size literature, arguing that it gives disproportionate weight to single studies that include a large number of estimates. An appropriate weighting, he says, would reveal that class size is indeed a determinant of student achievement. Prof. Hanushek counters that Prof. Krueger’s re-analysis achieves results different from his own by emphasizing low-quality estimates. He argues that other policies besides class size reduction, such as improving teacher quality, are more important. Jennifer King Rice (University of Maryland) brings a third-party perspective to the debate. She addresses each author’s arguments and focuses on the policy implications of the class size literature.”

Improving Teacher Evaluation to Improve Teaching Quality
National Governors Association
December 2002

"Governors understand the importance of guaranteeing that every child has an effective teacher. Research shows that teacher quality affects student achievement more greatly than any other school based variable. The No Child Left Behind Act requires a 'highly qualified' teacher in every classroom by the 2005-2006 school year and achievement gains by all students over time. These realities give policymakers a strong incentive to focus on preparing, recruiting, and retaining quality teachers as primary strategies to boost academic achievement."

Measuring the Content of Instruction: Uses in Research and Practice
Andrew C. Porter
Educational Researcher (Vol. 31, No. 7)
October 2002

"This article describes tools for measuring the content of instruction, the content of instructional materials, and the alignment between these. Illustrative findings about the use of these tools are reported, and possible additional uses, both for research and practice, are discussed. The validity of data produced through use of these tools is found to be quite good. An agenda for future work is sketched both for improvement of the quality and versatility of the tools and for use of the tools in research and practice."

Testing the Testers: An Annual Ranking of State Accountability Systems
Princeton Review
2003

"During the Winter of 2002-2003, The Princeton Review conducted Testing the Testers 2003, its second Annual Ranking of State Accountability Systems. Unlike other studies, ours is not primarily concerned with the rigor of academic standards or of the tests that measure them. Rather we focused on the policies that determine the overall character and effectiveness of each accountability system. Properly conceived and well-implemented, these policies will tend to produce systems that are consistent, secure, open to public scrutiny, and flexible enough to improve over time. We also believe they will tend to encourage and support an evolution to better and more effective schools. "

Teacher Quality Initiative
Public Education Foundation, Chattanooga, TN

Pages include a description of the project (which was discussed during the December 2002 TNE Institute in Washington, DC) and links to an October 2002 PowerPoint presentation of preliminary results, and results released in February 2003 of their teacher questionnaire.

What large-scale, survey research tells us about the effects of teachers and teaching on student achievement.
Brian Rowan
Consortium for Policy Research in Education
2002

“This paper is about conceptual and methodological issues that arise when educational researchers use data from large-scale, survey research studies to  investigate teacher effects on student achievement. In the paper, I illustrate the conceptual and methodological issues that arise in such research by reporting on a series of analyses I conducted using data from Prospects: The Congressionally Mandated Study of Educational Opportunity. This large-scale, survey research effort gathered a rich store of data on instructional processes and student achievement in a large sample of American elementary schools during the early 1990’s as part of the federal government’s evaluation of the Title I program. In this paper, I use data from this study to estimate the ‘overall’ size of teacher effects on student achievement and to test some specific hypotheses about why such effects occur. On the basis of these analyses, I draw some substantive conclusions about both the magnitude and sources of teacher effects on student achievement and suggest some ways that survey-based research on teaching can be improved.”

Educational Assessment Reassessed: The Usefulness of Standardized and Alternative Measures of Student Achievement as Indicators for the Assessment of Educational Outcomes
William L. Sanders, Sandra P. Horn
Education Policy Analysis Archive (Vol. 3 No. 7)
March 3, 1995

“For decades, the assessment of educational entities--school systems, individual schools, and teachers--has evoked strong and sometimes violent emotions from the educational community, the general public, and their legislative representatives. In spite of attempts to codify standards for the evaluation of these entities, assessment experts remain denominationalized--often religiously so. Methods of assessment based on the use of standardized tests have come under intense fire in recent years with some critics going so far as to call for their complete elimination. Those who advocate alternative methods of assessment have become increasingly outspoken in establishing exclusive rights to the legitimate assessment paradigm. However, some of the most respected advocates of alternative assessment have taken a more moderate view, warning against an ‘either-or’ mentality (Brandt, 1992, p. 35). Reflecting this more moderate perspective, this paper strongly advocates the use of multiple indicators of student learning, including those provided by standardized tests.”

Foundations for Success: Case Studies of How Urban School Systems Improve Student Achievement
Jason Snipes, Fred Doolittle, Corinne Herlihy
Council of Great City Schools
September 2002

"This report and the longer-term project of which it is a part focus on the potential role of the school district as an initiator and sustainer of academic improvement. While there has been much research on what makes an effective school, there is relatively little on what makes an effective district. In fact, many see large urban school districts as a source of problems rather than solutions. But for school improvement to be widespread and sustained, and for our nation to reduce racial differences in academic achievement, large urban districts must play a key role…. This report extends the existing research by examining the experiences of three large urban school districts (and a portion of a fourth) that have raised academic performance for their district as a whole, while also reducing racial differences in achievement."

Evidence Based Education Policies:
Transforming Educational Practice and Research
2002 Dewitt Wallace Reader's Digest Distinguished Lecture

Robert E. Slavin
Educational Researcher (Vol. 31, No. 7)
October 2002

"At the dawn of the 21st century, educational research is finally entering the 20th century. The use of randomized experiments that transformed medicine, agriculture, and technology in the 20th century is now beginning to affect educational policy. This article discusses the promise and pitfalls of randomized and rigorously matched experiments as a basis for policy and practice in education. It concludes that a focus on rigorous experiments evaluating replicable programs and practices is essential to build confidence in educational research among policymakers and educators. However, new funding is needed for such experiments and there is still a need for correlational, descriptive, and other disciplined inquiry in education. Our children deserve the best educational programs, based on the most rigorous evidence we can provide."

Expert Measures
Anita A. Summers
Education Next (Vol. 2, No. 2)
Summer 2002

“Critics of value-added assessment tend to embrace the concept but don’t want the results gleaned from such analysis to be used for accountability purposes—and especially don’t want to use the results to reward or sanction teachers…. The main concern with value-added assessment is that the technique exacerbates the amount of random error involved in measuring student performance. The risk is that teachers and schools may be wrongfully rewarded or punished because value-added techniques either over- or underestimated their students’ learning gains.”

How Schools Matter: The Link Between Teacher Classroom Practices and Student Academic Performance
Harold Wenglinsky
Education Policy Analysis Archives (Vol. 10, No. 12)
February 13, 2002
 
“Quantitative studies of school effects have generally supported the notion that the problems of U.S. education lie outside of the school. Yet such studies neglect the primary venue through which students learn, the classroom. The current study explores the link between classroom practices and student academic performance by applying multilevel modeling to the 1996 National Assessment of Educational Progress in mathematics. The study finds that the effects of classroom practices, when added to those of other teacher characteristics, are comparable in size to those of student background, suggesting that teachers can contribute as much to student learning as the students themselves.”





ABOUT US


WHAT'S NEW


SERVICES



PROGRAMS


PRODUCTS & PUBLICATIONS


RESOURCES


SITE MAP



CONTACT US




TNE Overview

Virtual Library Home

Design Principles:

Decisions Driven by Evidence

Engagement with the Arts & Sciences

Teaching as an Academically Taught Clinical Practice

Issues to be Addressed Jointly by Faculties in Education and the Arts & Sciences

General References