Average rating: | Rated 5 of 5. |
Level of importance: | Rated 5 of 5. |
Level of validity: | Rated 5 of 5. |
Level of completeness: | Rated 5 of 5. |
Level of comprehensibility: | Rated 5 of 5. |
Competing interests: | None |
Although the Khazan et al. (2020) study is so problematic that no professional reader will accept their conclusion of gender differences in student ratings, the surgical precision with which Uttl and Violo analyze this article is a joy. The Uttl and Violo paper should be required reading in all methodology courses.
In a replication of MacNell et al (2015), Khazan et al. (2020) had a teaching assistant (TA) evaluated who assisted in two sections of a course, once under her true gender and once under a false (i.e., opposite) gender. Thus, students who rated the performance of the TA were either under the impression that the TA was male or that the TA was female.
Uttl and Violo start out with the obvious and fundamental problem, namely that the mean difference in the ratings of the (apparently) male and female TA did not even approach statistical significance. This should already negate the conclusion that there was a gender bias in students’ ratings of the TA.
But Uttl and Violo point at other serious problems. One of the most serious, in my opinion, is that students were given one photo and one short biography of each of the TA’s. Obviously, such photos and biographies could bias student ratings even before the TA started his/her work. There is sufficient evidence in the research on SET’s that demonstrate the importance of first impressions of teachers. Thus, this material should not only have been carefully pretested, but also several photos and biographies should have been used for each gender. Furthermore, the study should also have used several TA’s or at least one male and one female TA, each appearing under their own and under the opposite gender.
Another problem of the study is the small sample size in each of the four conditions (i.e., TA gender x student raters’ gender). As Uttl and Violo point out, their statistical power to find a gender effect of the size reported in one of the major studies of gender differences in SET’s (Ottoboni & Stark, 2016) was only 0.18.
I do not want to go through all of their re-analyses, but merely repeat their damning conclusion, namely that the Khazan et al data reveal no evidence, whatsoever, that the female TA was rated differently from the male TA.
The fact that Dr. Khazan refused to provide Uttl and Violo with the raw data of their study suggests that they might have had little confidence that their own conclusions might stand up to closer scrutiny. Since most reputable journals these days require that data are made available, it also reflects poorly on the Journal of the North American Colleges and Teachers of Agriculture that they did not have this requirement.