EMAIL SIGN UP!
Most Popular This Week
- Corporate Win: Supreme Court Says Monsanto Has 'Control Over Product of Life'
- Patent Filing Claims Solar Energy ‘Breakthrough’
- Disaster Capitalism Strikes as Hedge Funds Circle Near-Bankrupt Municipalities Like Vultures
- In 'March Toward Disaster,' World Hits 400 PPM Milestone
- Ignoring Bee Crisis, EPA Greenlights New 'Highly Toxic' Pesticide
Popular content
Today's Top News
Neither Fair Nor Accurate: Research-Based Reasons Why High-Stakes Tests Should Not Be Used to Evaluate Teachers
Illustration: J.D. KingA pitched battle raged in my hometown of Seattle this fall. Superintendent Maria Goodloe-Johnson and the Seattle Public Schools district fought with the Seattle Education Association over their most recent teachers’ union contract. At the heart of the dispute: Should teacher evaluations be based in part on student scores on standardized tests?
Seattle is not unique in this struggle, and it is clear that Superintendent Goodloe-Johnson takes her cue from what is happening nationally.
In August, for instance, the Los Angeles Times printed a massive study in which LA student test scores were used to rate individual teacher effectiveness. The study was based on a statistical model referred to as value-added measurement (VAM). As part of the story, the Timespublished the names of roughly 6,000 teachers and their VAM ratings (see sidebar, p. 37).
In October the New York City Department of Education followed suit, publicizing plans to release the VAM scores for nearly 12,000 public school teachers. U.S. Secretary of Education Arne Duncan lauded both the Times study and the NYC Department of Education plans, a stance consistent with Race to the Top guidelines and President Obama’s support for using test scores to evaluate teachers and determine merit pay.
Current and former leaders of many major urban school districts, including Washington, D.C.’s Michelle Rhee and New Orleans’ Paul Vallas, have sought to use tests to evaluate teachers. In fact, the use of high-stakes standardized tests to evaluate teacher performance à la VAM has become one of the cornerstones of current efforts to reshape public education along the lines of the free market.
On the surface, the logic of VAM and using student scores to evaluate teachers seems like common sense: The more effective a teacher, the better his or her students should do on standardized tests.
However, although research tells us that teacher quality has an effect on test scores, this does not mean that a specific teacher is responsible for how a specific student performs on a standardized test. Nor does it mean we can equate effective teaching (or actual learning) with higher test scores.
Given the current attacks on teachers, teachers’ unions, and public education through the use of educational accountability schemes based wholly or partly on high-stakes standardized test scores and VAM, it is important that educators, students, and parents understand why, based on educational research, such tests should not be used to evaluate teachers.
Although there are many well-documented problems with using VAM to evaluate teachers, I’ve chosen to highlight six critical issues with VAM that are so problematic they alone should be enough to stop the use of high-stakes standardized tests for such evaluations. I hope these will be helpful as talking points for op-ed pieces, blogs, and discussions at school board meetings, PTA meetings, and in the bleachers at basketball games.
Statistical Error Rates
There is a statistical error rate of 35 percent when using one year’s worth of test data to measure a teacher’s effectiveness, and an error rate of 25 percent when using data from three years, researchers Peter Schochet and Hanley Chiang find in their 2010 report “Error Rates in Measuring Teacher and School Performance Based on Test Score Gains,” released by the U.S. Department of Education’s National Center for Education Statistics.
Bruce Baker, finance expert at Rutgers University, explains that using high-stakes test scores to evaluate teachers in this manner means there is a one-in-four chance that a teacher rated as “average” could be incorrectly rated as “below average” and face disciplinary measures. Because of these error rates, a teacher’s performance evaluation may pivot on what amounts to a statistical roll of the dice.
Year-to-Year Test Score Instability
As Tim Sass, economics professor at Florida State University, points out in “The Stability of Value-Added Measures of Teacher Quality and Implications for Teacher Compensation Policy,” test scores of students taught by the same teacher fluctuate wildly from year to year. In one study comparing two years of test scores across five urban districts, more than two-thirds of the bottom-ranked teachers one year had moved out of the bottom ranks the next year. Of this group, a full third went from the bottom 20 percent one year to the top 40 percent the next. Similarly, only one-third of the teachers who ranked highest one year kept their top ranking the next, and almost a third of the formerly top-ranked teachers landed in the bottom 40 percent in year two.
If test scores were an accurate measurement of teacher effectiveness, “effective” teachers would rate high consistently from year to year because they are good teachers; and one would expect “ineffective” teachers to rate low in terms of test scores just as consistently. Instead, the year-to-year instability that Sass highlights shows that test scores have very little to do with the effectiveness of a single teacher and have more to do with the change of students from year to year (unless, of course, one believes that one-third of the highest ranked teachers in the first year of the study simply decided to teach poorly in the second).
Day-to-Day Score Instability
Fifty to 80 percent of any improvement or decline in a student’s standardized test scores can be attributed to one-time, randomly occurring factors, according to Thomas Kane of Harvard University and Douglas Staiger of Dartmouth College in their research report “Volatility in Test Scores.”
This means that factors such as whether or not a child ate breakfast on test day, whether or not a child got into an argument with parents or peers on the way to school, which other students happened to be in attendance while taking the test, and the child’s feelings about the test administrator account for at least half of any given student’s standardized test score gains or losses. Some factors, such as a dog barking outside an open window, can affect an entire class.
Kane and Staiger’s findings illustrate that using tests to evaluate teachers ignores the reality that a host of individual daily factors that are completely out of a teacher’s control contribute to how a student performs on any given test. To reward or punish a teacher based on such scores could literally mean rewarding or punishing a teacher based on how well or poorly a student’s morning went.
Nonrandom Student Assignments
The grouping of students—either within schools through formal and informal tracking or across schools through race, socioeconomic class, and linguistic (ELL) segregation—greatly influences VAM test results, as 10 leading researchers in teacher quality and educational assessment highlight in their policy brief “Problems with the Use of Student Test Scores to Evaluate Teachers,” published by the Economic Policy Institute.
These researchers note that “teachers who have chosen to teach in schools serving more affluent students may appear to be more effective simply because they have students with more home and school supports for their prior and current learning, and not because they are better teachers.”
Even when VAM models attempt to take into account a student’s prior achievement or demographic characteristics, the models assume that all students will show test gains at an equal rate. This assumption, however, does not necessarily hold true for groups of students who historically have performed poorly on tests, for English language learners who are asked to become proficient in both a new language and a tested subject area, or for students with disabilities whose test-based rates of progress may be incomparable to any other student.
Nonrandom student assignment means that a teacher could be punished, dismissed, or lose tenure purely because the course they teach or the school they teach in has a significant population of traditionally low-scoring students who may show variable or slower test score gains.
Imprecise Measurement
High-stakes, standardized tests are also unable to account for the complexities of learning (and, by extension, teaching). For instance, we know from the linguistic research of Steven Pinker and others that learning often happens in a U-shape—that making mistakes is an integral part of the learning process. When children are tested, we never quite know where on the U-shaped learning curve they might be, nor do we realize that their mistakes could be a vital part of a natural learning process. When tests are used to evaluate teachers, it is possible that highly effective teachers who push students out of their cognitive comfort zones are penalized for provoking the deep learning that requires students to make mistakes on the way to greater understanding.
Standardized tests are also too crude to account for the possibility of cognitive transfer of skills that students learn across different subjects. Using VAM, as the researchers in the above-mentioned Economic Policy Institute policy brief explain, means that “the essay writing a student learns from his history teacher may be credited to his English teacher, even if the English teacher assigns no writing; the mathematics a student learns in her physics class may be credited to her math teacher.” In other words, we can never be certain which class and which teacher contributed to a given student’s test performance in any given subject.
Out-of-School Factors
Out-of-school factors such as inadequate access to health care, food insecurity, and poverty-related stress, among others, negatively impact the in-school achievement of students so profoundly that they severely limit what schools and teachers can do on their own, explains David Berliner, Regents Professor of Education at Arizona State University, in his report “Poverty and Potential.”
Although it is clear from the research of Stanford University’s Linda Darling-Hammond and others that teachers play an absolutely pivotal role in student success, when we use high-stakes tests to evaluate teachers, we incorrectly assume that teachers have the ability to overcome any obstacle in students’ lives to improve learning. Although good teachers are critically necessary, they are not always sufficient.
To assume otherwise is to think that teachers (and schools) can somehow make up for the lack of housing, food, safety, and living wage employment, among other factors, all on their own. The social safety net is the responsibility of a much broader socioeconomic network—not the sole responsibility of the teacher.
Politics, Not Reality
The reality of standardized tests is that they are too imprecise and inaccurate to measure the effectiveness of individual teachers. The sad thing is that testing experts, researchers, and psychometricians have known this for quite some time. In 1999, for instance, the expert panel that made up the Committee on Appropriate Test Use of the National Research Council cautioned that “an educational decision that will have a major impact on a test-taker should not be made solely or automatically on the basis of a single test score.”
Yet two short years later, a bipartisan Congress and the presidential administration of George W. Bush passed No Child Left Behind and its test-and-punish approach to school reform into law.
Although the Bush administration seemed to ignore educational research as a matter of policy (as illustrated through NCLB’s Reading First program and the advocacy of using phonics-only teaching methods that had little basis in research), many hoped for something different with the election of President Obama.
Unfortunately, the Obama administration has sent a clear message: When it comes to high-stakes standardized testing, the research doesn’t matter.
It hasn’t mattered that, according to the above cited U.S. Department of Education report, “More than 90 percent of the variation in student gain scores is due to the variation in student-levelfactors that are not under control of the teacher.”
It hasn’t mattered that the National Research Council of the National Academy of Sciences has stated that “VAM estimates of teacher effectiveness should not be used to make operational decisions because such estimates are far too unstable to be considered fair or reliable.”
It hasn’t mattered that even the researchers who completed the Los Angeles Timesstudy acknowledged that VAM data were too unreliable to use as the sole measure of teacher performance (a point that the Timesneglected to clearly articulate in their article).
Sadly, with Bush, now with Obama, politics and ideology trump educational research.
One would think that all of the policy makers, politicians, pundits, superintendents, talk show hosts, documentary movie makers, business leaders, and philanthropic foundations so in love with the idea of using test score data to evaluate teachers would be equally as passionate about accuracy. People’s lives are at stake, and yet the “data” underlying important decisions about teacher performance couldn’t be shakier.
The shakiness of test-based VAM data illustrates that the current fight over teacher “accountability” isn’t really about effectiveness. The more substantial public conversation we should be having about rising poverty, the racial resegregation of our schools, increasing unemployment, lack of health care, and the steady defunding of the public sector—all factors that have an overwhelming impact on students’ educational achievement—has been buried. Instead, teachers and their unions have become convenient scapegoats for our social, educational, and economic woes.
Yes, teachers’ performance needs to be evaluated, but in a manner that is fair and accurate. Using high-stakes standardized tests and VAM to make such evaluations is neither.
- Posted in
Comments
Note: Disqus 2012 is best viewed on an up to date browser. Click here for information. Instructions for how to sign up to comment can be viewed here. Our Comment Policy can be viewed here. Please follow the guidelines. Note to Readers: Spam Filter May Capture Legitimate Comments...


26 Comments so far
Show AllAs a former teacher, I agree with everything said in this article, but I think he left out the most important part of the evaluation problem. Tell us how teachers should be evaluated. I would like to know how it should be done.
It is not the purpose of the article to explain how teachers should be evaluated. The author is simply showing why such evaluations should not be based solely on test scores.
The relationship between teaching and learning is complex; no simple, turnkey mechanism can accurately evaluate teachers - or learners.
q
@readbetweenthe_lines
I totally agree with your analysis. I have an anecdote of my own on the matter of standardized testing.
I am a lawyer by trade. I took the SAT and the LSAT. Did significantly above average on both, but certainly did not blow either exam out of the water.
I took the bar exam in two states, and my scores on both exams were through the roof. The difference was because, as you alluded to, you cannot really study for the exams like the SAT and the LSAT.
The LSAT in particular measures one's ability to perform logical functions (i.e., determining what the conclusion of an argument is, determining which premise is necessary for an argument to be true). You can study for it, but only in a very limited sense. For the reasons you mentioned, you are either good at taking those kinds of exams or you aren't.
The bar exams, on the other hand, were based on substantive material that one could study, understand, and remember. The exam measures both one's ability to remember facts and understand legal concepts, as well as one's ability to recognize which concepts and facts apply in various situations.
It has been determined that there is a high correlation between performance on the LSAT and first year law school grades. However, this is not necessarily because students who do well on the LSAT are smarter than those who do not.
It turns out, first year law school exams are short (several hours at most), timed exams. So is the LSAT. Again, reiterating what you said, what both the LSAT and first year law school exams have in common are that they tend to measure reading speed, the ability to handle short, high-stress tests, and the ability to remember core principles that would be useful to maximize points on a short exam.
When you look at the relationship of the LSAT to second and third year law school grades, the correlation falls away entirely. The reason is that second and third year professors include more writing assignments, and generally will give significantly longer exams. Thus, more factors are taken into account in assigning grades for those years.
Not sure why there's such an issue with firing an incompetent teacher (besides having to deal with union). I never worked as teacher but in all companies i worked for, it was usually known who the slackers and incompetents were. I bet it is the same in schools. Why can't the principal or whoever is running the place call the person in his office and tell them straight they are not up to par an fire them?
They can and they do, and they threaten teachers with it far more often than they actually carry it out. The trouble lies with standards.
In a company, with few exceptions, the standard is production of profit, and the judge is the employer. That's pretty crisp, but primarily because it does not take into account the harm or benefit to most people who use the system, only to the owners.
The school systems answer to the state, the city or county, the taxpayers, the parents of students, and to students themselves to some degree. These groups disagree over what constitutes good teaching and good service. By and large, students don't want an old sourpuss, parents have great unwarranted faith that harsh discipline will produce sharp skills in reading and writing, businesses want productive employee-bots for chump change, and most everyone on the right wants heavily reinforced social propaganda. An administrator should have her or his own ideas as to what constitutes competence, but only a fool or someone with little experience of the business would expect the community to agree.
Schoolteachers in K-12 work under threat. This is only slightly true of those of us who teach adults, but it is extremely true of K-12 teachers. Those in K-12 have to use certain materials, have to use certain methods, and have to avoid dealing with certain topics that are extremely relevant to their students, mostly concerning sex, religion, and politics.
Of course, if you feel that not backing state-sponsored omissions in historical information is "slacking" (not that you do yourself, but many do), then firing a teacher who teaches differently may seem straightforward. If you think that students need to be disciplined heavily to learn (again, not that you do) then it seems natural to reward the harsh and penalize the more flexible instructor. If you think that classes should be interesting, then it is likely (though not guaranteed!) that neither of the former instructors is doing a good job.
Hence the argument over standards. But it seems to me that there is another problem with this argument as I understand you to have presented it.
You appear to have the notion that this is handled more or less efficiently and effectively by corporations and larger companies in general. Forgive me if I am lumping you in with people whose opinions you may not share, but on the face of it this sounds like the old Reaganite myth of corporate and so-called "free market" efficiency.
As nearly as I can tell, the only reason these ideas get repeated so often, and often by people who are not part of that movement, is that no one seems to have bothered to check. Like other despotic forms, corporations are abysmally inefficient and ineffective. We have some ineffectiveness where I teach, but thank goodness it isn't run like the private companies where I worked for thirty years!
By and large, my administrators work to arrange to have as many students taught as possible for the students' and the government's money. If we're talking about providing service, that is far more efficient than trying to get as much money as possible for services rendered, which of course is what a corporation generally does (not-for-profits excluded).
To my knowledge, with all our various competencies and incompetencies and differences of opinion, none of my colleagues are trying to avoid teaching or is not pleased to find that student learns in some way that can be verified. Would anyone argue that no one in a large office tries to avoid work or does not care that the company provides its services well?
Out here in California, we are also watching the process of a one-time state-run school warping under the stress of the corporate model. I do not teach at the University of California, but have always respected UC campuses as affording some of the world's finest education. As they progressively privatize, salaries for a few top administrators rise, salaries for rank-and-file faculty and employees drop, rates charged to students rise -- a drop in efficiency of service, certainly -- class sizes for undergraduates rises, more teaching is farmed to grad students, and -- well, it might be nice to know just what the benefit of this privatization is to anyone who does not happen to own a chunk of university.
No, let us not use corporations as a model. I would carp on, but I cede to a higher source: www.dilbert.com/
Machiavelli pointed out that the people of a nation get the government that they deserve. Unfortunately, as this article makes clear, they also get the educational system that they deserve.
"Sadly, with Bush, now with Obama, politics and ideology trump educational research."
Reading this article is as frustrating as it is informative.
q
And then fire the teachers who don't improve, revoking their tenure. They will sue, but in most cases the school district can cover most if not all the cost of the litigation in the spread between the incompetent's salary (which, by union rules of course grew from year to year without regard to ability or performance) and the salary of a new, vigorous, younger teacher who comes in as a newbie. The fired, litigious teacher will eventually lose the litigation and then really be screwed, which wiill scare the hell out of the next teachers who are "invited" to resign because they are incompetent or worse.
The funny thing about this is that all of it is already done. This is actually a fairly accurate description of the existing review process.
I want to add my two cents (and that is probably all it is worth). I have two years teaching experience. One year was a disaster for the students because I was not into it and frankly did not know how (but in spite of me, some of them learned). A few years later I was more mature and actually liked what I was doing. That year was very successful for all the students (all except one that I could not get involved). I have three daughters. They all went to the same grade school where they got mostly A's. My wife and I made sure they did their work. We were involved in their learning process. But, no credit to us, they were all three very bright and learning the subject matter came easily for them. We knew which teachers were very good and we also knew which were mediocre. But we made it a point to always support the teacher. Our oldest girl finished high school first in her class--and she worked a job in her last two years of school. We moved to another district at that point and our other two attended high school in the new district. They both (twins) came out of the gate in first place and never looked back. Once again they had a mix of good and mediocre teachers. But they learned anyway. If their teachers had been evaluated on the basis of my kids test scores (as first term sophomores they had scores of 31 and 33 on the ACT), the teachers would have been top-rated.
Because of this and my own educational/teaching experience, I believe that a student who has a desire to learn and the motivation and support from his/her family will do as well as he/she can in school regardless of teacher quality. If the administration is making sure that the teachers are covering the material and the teachers are maintaining an environment in which learning can take place, the students will pass muster and then some. Student test scores will define the teacher.
My bad. It should read "Student test scores shoutd NOT define the teacher."
thanks. I am really surprised that anyone read the meanderings of an old man..
I find the educational system suspect, and feel there is historical evidence to support...
I, therefore, do not understand the effort to succeed in such a system, either as a teacher or as a student...
the system is not what it appears to be, and does far more harm than good...
it should be disbanded, along with the rest of the modern world...
This is a comment on the title only: High stakes tests should not be used to evaluate students.
Pretty sure if I read the article, that would solve the problem.
As a veteran teacher of 20 years, I say this:
http://open.salon.com/blog/daniel_geery/2011/01/24/no_childs_behind_left_part_1_of_2
and this:
http://open.salon.com/blog/daniel_geery/2011/01/24/no_childs_behind_left_part_2_of_2
Standards, standardized testing, class tests and any other devices that purport to be able to break teaching and learning down into little discreet bits of information and assign a number/grade are all based on a basic logical fallacy. It is logically impossible to quantify a quality. They are two different logical categories and logically cannot be confused and conflated.
Teaching and learning are a quality of a dynamic social interaction that is constantly changing and therefore is not amenable to quantification. Any attempts to quantify teaching and learning really amounts to "mental masturbation" (a term that has gotten me into trouble before). Unfortunately, I have to play that game on a daily basis, however, I try to make sure the students know that it is just that-a "grade" game and that the focus/purpose of the class is to learn Spanish not just to get a grade.
I firmly believe that grades do far more harm than good. Would you like to be forced to go into a situation where you are constantly told you are average, below average or even an "F"-failing (the only letter grade that has an actual meaning attached to it) student. I wouldn't (I am fortunate to be one who by nature happens to be a natural test taker so my analysis does not stem from some left over pent up anger at schooling). No wonder many students want to, attempt to, leave that environment ("dropping out") by the time they are teenagers.
hey, oye!
I know you and I have gone 'round on this topic before, but I wanted to tell you that I admire your passion and clarity of personal vision, and I appreciate this post very much, with your focus on quality vs. quantity...
when one wishes to argue in a scientific world, one must be able to measure, even if the measurements are false...economics certainly comes to mind...
handy to create the region of focus, and the criteria for measuring...
as much as I see the schools as a mechanism for brainwashing, I suspect you, although I do not personally know you, of being a shining example of our best hope for a teacher struggling under such circumstances with the best ineerests of their charges at heart...your words speak well of you...
btw, 3 of my parents have been teachers...2 high school, one college...
Dubet,
Thanks for the kind response! I enjoy the chance to have rational, logical and civil interactions with the other writers on CD.
I try to do my best to "educate" my students not only in Spanish but in the many other circumstances that come up on a daily/hourly basis in a day of teaching. I have been called on the carpet many times for comments/brief class discussions of current events, school policies, etc. . . because they weren't "curriculum" related. I still take the chances as many times it's the only time that a student has a chance to discuss things with an "adult" in a non threatening setting.
You stated: "when one wishes to argue in a scientific world, one must be able to measure, even if the measurements are false...economics certainly comes to mind... " I have no trouble with measurements in I consider the "hard" sciences, physics, chemistry, biology, astronomy etc. . . . And I also do not have a problem with having that "scientific" concern with accuracy, methodology and sceptical thinking. But not everything can be measured and the "soft" sciences, anthropology, economics, psychometrics, much educational theory, etc. . . fall outside the realm of "true" science. The soft sciences "pretend" that they are "hard" sciences.
And if one starts with "error" in measurement any subsequent conclusions by defition are fraught with error. Yes, there is a possibility that one can still come up with a "correct" answer but those chances are quite slim as even the blind squirrel gets an acorn every now and again.
The problem stems from the differences in the various "language games" (to use Wittgenstein's term, later expounded upon by Lyotard-I use the names to identify those, many more times brilliant than myself, who originate the concepts in case one wants to read more about them) that we employ in different social settings. For example, in the courts just vs unjust is supposed to be the defining discourse, in business efficient vs inefficient in relation to the bottom line is the lingua franca, in the hard sciences as in mathematics something is or isn't true. The logical problems result when we attempt to use the language from one "game" to judge another "game".
In education, to me, the language game is "is it just or unjust and right or wrong by/for the student". For many years, and especially now with NCLB and RTTT, and the business "philanthropist's" funding of charter schools, etc.. . the attempt has been made to make the business language game of efficient vs inefficient, and the language of the hard sciences the language games of education. The powers that be and the vast majority of the educational establishment have all bought into the concept that education is a science (with all it's accompanying pseudo scientific language). It can't work as these "games" are incompatible with the education "game".
Another way to look at it is that teaching/learning is an art and not a science. To attempt to turn teaching/learning into a "science" is to, again, confuse language games which result in real harm to the students-"failing" a course, not being able to obtain scholarships based on standardized tests, not being admitted to a certain profession etc. . . .
Again, thanks for the kind words!
OYE
P.S. Does anyone else have a problem with the cursor making the page jump around in trying to type in this very small window????
Who tests the testers? Who tests the testers of the testers? Ad infinitum. I like the observation by Woody Allen: "God is either cruel or incompetent".
Psychometrics is a science. Psychometrics is a power game. Like Certs, Psychometrics is TWO, TWO, two things in one.
I spent may years of my life on this battle ground, administering standard tests, developing my own normative data, sometimes creating my own test instruments. It is an ethical minefield.
Trylon
Better said, "psycometrics is a pseudo-science". See Noel Wilson's dissertation "Educational Standards and the Problem of Error" to be found at
http://epaa.asu.edu/ojs/article/view/577
=It is logically impossible to quantify a quality. They are two different logical categories and logically cannot be confused and conflated.=
Sorry but this is incorrect. Metrics are applied to quality day in and day out. It is the basic data of Quality Assurance. It is the basic data of opinion polls and consumer product testing and usability testing. It is the data of medical information gathering, e.g. pain, sleep, recovery rating. The numerical results are often tested by Chi Square.
Psychometrics is the science of scaling and measuring a CONSTRUCT. While beauty is in the eye of the beholder, we say things like "Tits 7, ass 6" on a subjective unidirectional scale. Or we can use Lickert Scales, going from Minus X to Plus X. Indicate your agreement with the statement: "Her tits and ass are gorgeous." Minus 5 = Strongly Disagree, minus 3 =, Mildly Disagree, 0 = I feel neutral about them, plus 3 = Mildly Agree, plus 5 = Strongly Agree. And - - judged beauty is a mainstay in the pedagogical problem of Halo Effect.
But _ I'll go have a look at the article. Thanks.
Trylon
One other observation: Fifty years ago there was no public concern over whether teachers were doing a good job or not. The fact was that all teachers were grossly underpaid. Those who continued to be teachers were a committed bunch of people. Most teachers in elementary levels were female. The rule of thumb back then said that women did the jobs that no man would do and they got paid far less for the job. But gradually, because of unionization, teachers began to earn an amount close to 75% of their fellow college graduates. They fought for parity but it never came. During the same time Educational Administrators saw their wages go to double that of the highest paid teachers. The majority of these administrators were male. Suddenly the public was told that their kids were not doing well because of the following:
a. teachers were overpaid ("after all they only work 9 months a year"--never mind that state law requires them to attend school in summer for continuing ed.)
b. teachers do as they please because the union won't allow any discipline.
c. teachers are too permissive. (never mind that armed police will not even venture into a classroom setting unless they have backup.)
d. teachers are like air controllers. Break their union and then you will get professional behavior from them.
e. teacher salaries are the reason for high taxes. (the administrators spend billions of dollars building country club campuses that are finer than any university I attended (7) at the time I attended).
The only thing that will satisfy the critics is to put a cap on teacher salaries at about $35,000 and allow termination of any teacher without any kind of due process. That way they can have $50 million dollar school building staffed with a principal making $85,000, three assistants making $75,000 each, a athletic director and coaches making $70,000 each (all previous positions male) and teachers earning an average of $32,000 each (all female). Then the critics will have their dream realized. And guess what they will still blame high costs on the greedy teachers. And watch student test scores go down when they can't find teachers to put in the luxury classrooms.
"although research tells us that teacher quality has an effect on test scores, this does not mean that a specific teacher is responsible for how a specific student performs on a standardized test."
Notwithstanding all the rest of the post, this particular point is obvious nonsense. Perhaps it might better have been put "this does not mean that from a specific student's performance on a particular test, we can know how much of that student's performance a particular teacher is responsible for."
This might be true. And the answer is to grade the teacher over several student's tests. The reply to that is that the students that a teacher gets is not a random sample, and so on it goes.