Discrimination in track recommendation but not in grading: experimental evidence among primary school teachers in Hungary
Abstract
Abstract This study examines discrimination in teacher assessments and track recommendations against Roma minority students in Hungary. We conducted a pre-registered randomized experiment among 413 primary school teachers. Participating teachers evaluated six mathematics or literacy and grammar tests with fictitious, randomized student names and recommended a high school track. Our results show mixed evidence for discrimination against Roma students: teachers do not discriminate in test evaluations but do so in high school track recommendations, though this latter effect is small. We find that contextual factors play a substantial role in discrimination in track recommendations: teachers who receive tests with fewer Roma than non-Roma names discriminate against Roma students, whereas teachers who receive tests with more Roma names do not. In the latter case, non-Roma students receive similarly low track recommendations as Roma students in both experimental conditions. The results are consistent with stereotype-based theories of discrimination.
GROBID Extracted text; discontinued.
This text is generated from TEI extraction for accessibility, search, and TTS. Formulas, tables, figures, page layout, and references may not perfectly match the original PDF.
Extracted abstract
This study examines discrimination in teacher assessments and track recommendations against Roma minority students in Hungary. We conducted a pre-registered randomized experiment among 413 primary school teachers. Participating teachers evaluated six mathematics or literacy and grammar tests with fictitious, randomized student names and recommended a high school track. Our results show mixed evidence for discrimination against Roma students: teachers do not discriminate in test evaluations but do so in high school track recommendations, though this latter effect is small. We find that contextual factors play a substantial role in discrimination in track recommendations: teachers who receive tests with fewer Roma than non-Roma names discriminate against Roma students, whereas teachers who receive tests with more Roma names do not. In the latter case, non-Roma students receive similarly low track recommendations as Roma students in both experimental conditions. The results are consistent with stereotype-based theories of discrimination.
Introduction
Ethnicity is one of the main student characteristics along which substantial educational inequalities occur. In most Western countries, children of some immigrant and ethnic-racial minority groups considerably lag behind majority students in terms of academic achievement and educational attainment (Heath and Brinbaum, 2007; Borgna and Contini, 2014) .
Ethnic biases in teacher assessments and track recommendations might contribute to the ethnic educational penalties. In many educational systems, teacher-given grades are taken into account in within-school track placement decisions or at transition points to the next educational level. Similarly, teachers' track recommendations influence students' track placements either because they are binding or because informal recommendations influence families' track choices (Caro et al., 2009; Boone and Van Houtte, 2013) . Several observational studies have shown ethnic biases in teacher-given grades (Lindahl, 2007; Ouazad, 2008; Burgess and Greaves, 2013; Kiss, 2013; Botelho, Madeira and Rangel, 2015; Triventi, 2020; Kisfalusi, Janky and Takács, 2021 ; for a review, see Zanga and De Gioannis, 2023) and track recommendations (Caro et al., 2009; Boone and Van Houtte, 2013; Geven, Batruch and van de Werfhorst, 2018; Timmermans et al., 2018) , conditional on students' school performance.
Observational studies, however, often suffer from the problem of omitted variable bias: unobserved differences in students' motivation, aspirations, or effort might explain the ethnic differences in teacher assessments and track recommendations (Small and Pager, 2020) . Experimental studies overcome this limitation by eliminating potential unobserved ethnic differences and holding student performance across ethnic groups constant. These studies identify discrimination by randomly assigning minority and majority names to tests teachers need to evaluate. Experimental evidence was found for grading discrimination against low-caste students in India (Hanna and Linden, 2012) and against students of Turkish origin in Germany (Sprietsma, 2013) but not in the Netherlands (van Ewijk, 2011) . Furthermore, experimental evidence was found for ethnic discrimination in secondary school track recommendations in Germany and the Netherlands (van Ewijk, 2011; Sprietsma, 2013; Wenz and Hoenig, 2020) . In Hungary, experimental studies were only conducted among preservice teachers but not among in-service teachers: while no discrimination was found in grading (Takács, 2018) , there was discrimination against Roma students in track recommendations (Bruneau et al., 2020) .
This paper analyses discrimination in teacher assessments and track recommendations against Roma students in Hungary, using a pre-registered randomized experiment. In the experiment, in-service teachers were asked to evaluate mathematics and literacy tests of fictitious students and then recommend a high school track (N=413 teachers, 2,478 teacher-test observations). Students' names on the tests were randomized (Roma male, Roma female, non-Roma male, and non-Roma female). In addition to testing the pre-registered hypotheses on whether teacher discrimination exists, we explore contextual effects on discrimination. Specifically, similar to the studies by van Ewijk (2011) and Sprietsma (2013) , we examine whether the ethnic composition of the names teachers encounter during the experiment moderates discrimination.
Our contribution to the literature is 2-fold. First, we investigate discrimination in teacher assessments and track recommendations against one of Europe's largest and most disadvantaged non-immigrant minority groups, whereas most previous experimental studies have focussed on discrimination against immigrant groups. In many developed countries, immigrants are a positively selected group in education with respect to their sending countries (Engzell, 2019) . Therefore, conditional on socioeconomic background and school performance, children of immigrants often have higher educational aspirations and make more ambitious educational choices than their native peers (Jonsson and Rudolphi, 2011; Salikutluk, 2016; Engzell, 2019) . There is no similar evidence in the case of Roma students; a recent study found that Roma students have less ambitious secondary school choices compared with non-Roma students with identical abilities, which is only partly explained by their lower socioeconomic status (Kisfalusi, 2023) . Thus, teacher attitudes, stereotypes, and discrimination might significantly differ in the case of children of immigrants and the Roma.
Second, we highlight the role of contextual effects in discrimination. By integrating stereotype-based theories of discrimination with findings from research on school segregation, we argue that the perceived characteristics of the school environment might moderate the extent of discrimination against minority students (McKown and Weinstein, 2008; Glock, Kovacs and Pit-ten Cate, 2019) . By asking teachers to evaluate multiple tests and varying the ethnic composition of the names, we show that the composition of the names moderates the effect of the individual names on track recommendations. This has important methodological implications: discrimination experiments are sensitive to the experimental design. The composition of the fictitious names used in the experiment might influence participants' discriminatory tendencies.
The remainder of the paper is organized as follows: in the next sections, we review the relevant theories of discrimination, explain why the ethnic context might play a role in discrimination, and introduce the Hungarian educational context. Then, we present the experimental design, the pre-registered analytical plan, and the steps of the exploratory analysis. In the Results section, we first present the confirmatory analysis of the pre-registered hypotheses; then, we conduct an exploratory analysis of the role of the ethnic context. The final section contains a discussion of the findings and limitations.
Theories of discrimination
In line with the definition of discrimination provided by Blank et al. (2004) and Pager and Shepherd (2008) , we define discrimination in teacher assessments as the differential treatment of students on the basis of a social category (Bygren, 2020) . Economic, sociological, and psychological theories identify two main sources of discrimination: group-related preferences or prejudices, and group-related prior beliefs or stereotypes.
Taste-based theories of discrimination assume that individuals treat social groups differentially based on their preferences for their in-group (Turner, 1975; Tajfel and Turner, 1979; Tajfel, 1982) or prejudices towards the out-group (Blumer, 1958; Blalock, 1967; Quillian, 1995) . The most prominent economic model of taste-based discrimination was provided by Becker (1957) , who suggested that individuals with inherent preferences against an out-group would be willing to pay a cost to avoid interactions with members of this group.
The other main line of discrimination theories emphasizes the role of imperfect individual information in discrimination and assumes that people, consciously or unconsciously, rely on their group-related prior beliefs, experiences, and stereotypes in social interactions (Quillian, 2006; Lorenz, 2021) . These stereotypes may be accurate or inaccurate (Jussim et al., 2009) . Social psychological theories suggest that individuals automatically categorize others based on group-specific stereotypes first. Then, as more information becomes available and they are motivated to update their category-based opinion, they integrate individuating DISCRIMINATION IN TEACHER ASSESSMENTS AND TRACK RECOMMENDATIONS information into their person perception (Fiske and Neuberg, 1990; Fiske, 1998) . Economic theories of statistical discrimination assume, too, that individuals rely on their prior beliefs about the performance of social groups until information about individual characteristics becomes available (Phelps, 1972; Arrow, 1973; Aigner and Cain, 1977) . While early work in this literature assumes that decision makers use the correctly perceived average characteristics of the target group, and thus, they rely on information that is correct on average (Phelps, 1972; Arrow, 1973) , more recent work on statistical discrimination recognizes that information or prior beliefs about group characteristics might be inaccurate (England and Lewin, 1989; Bohren et al., 2023) . We will refer to these social psychological and economic theories collectively as stereotype-based theories of discrimination. In this context, the term 'stereotype' denotes a belief held about a social group that may or may not be accurate on average.
In randomized experiments such as ours, teachers only know the names of the (fictitious) students whose tests they need to evaluate but have no additional knowledge about the students. The quality of the test, however, provides information on students' abilities. In this setting, taste-based theories of discrimination suggest that teachers, who generally belong to the majority ethnic group in the Hungarian context, favour majority students and disfavour minority students in both test evaluations and track recommendations. Although stereotypes might also influence test evaluations if, for instance, teachers assume that minority students cheated during the test, stereotype-based discrimination is more likely to occur in track recommendations. When recommending a secondary school track, teachers usually take into account other factors besides students' abilities (Boone and Van Houtte, 2013) . Information about these factors is lacking in the experimental situation. Hence, teachers might substitute the lacking individual information with prior beliefs about group characteristics.
Our pre-registered hypotheses concentrate on whether there is discrimination against Roma students in test evaluations and track recommendations. We hypothesize that Roma students receive lower test scores on the same test than non-Roma students (Hypothesis 1) and that Roma students receive lower track recommendations than non-Roma students based on the same test results (Hypothesis 2). In the discussion, we also address the potential underlying mechanisms that might explain our findings.
The role of the ethnic context in discrimination
In an exploratory analysis, we examine whether the ethnic composition of the names teachers encounter in the experiment moderates discrimination against Roma students. We argue that in discrimination experiments where participants are presented with multiple fictitious names, the composition of the names (i.e., the share of minority names) might carry important information for teachers over and above the individual names (i.e., whether the name is a minority name or not). While individual names indicate students' ethnic group belongings, the ethnic composition of the names might provide information on and activate stereotypes about students' school environment (McKown and Weinstein, 2008; Glock, Kovacs and Pit-ten Cate, 2019; Keller, 2024) . Specifically, a higher number of ethnic-sounding names might trigger perceptions of segregated schools. 1 Research on school segregation from both sides of the Atlantic shows that schools with a high proportion of minority students usually also enrol lower-status majority students than schools with a low proportion of minority students (Brunello and Rocco, 2013; Billings, Deming and Rockoff, 2014) . This phenomenon is also observed in Hungary. Since standardized achievement tests demonstrate a considerable disparity in performance by ethnicity and social status, this also means that segregated schools exhibit a lower average performance than schools attended by high-status majority students (Hermann and Kisfalusi, 2023) .
As we highlighted earlier, stereotype-based theories of discrimination assume that teachers substitute the lacking individual information with prior beliefs about group characteristics. If a higher number of minority names indeed trigger perceptions of segregated schools, then ethnicity is not the only information teachers might rely on in the experiment. Besides their prior beliefs about minority and majority students' academic achievement, teachers might also rely on their prior knowledge about the achievement level of students attending segregated schools and combine these two pieces of information in their decision-making. How these different stereotypes are combined determines whether we can expect a moderation effect to occur (Nicolas, de la Fuente and Fiske, 2017) .
One possibility is that negative stereotypes about minority students and negative stereotypes about students attending segregated schools simply add up. This implies that tests with Roma names as well as tests with non-Roma names receive lower evaluations and track recommendations when the share of minority students is higher compared with the case when this share is lower. However, the ethnic gap is the same in the two conditions, with minority students receiving lower evaluations.
Another possibility is suggested by studies which have demonstrated that students with similar academic abilities are graded less favourably in a class with a higher average achievement level than in a class with a lower average achievement level (Marsh, 1987; Südkamp and Möller, 2009) . If this contrast effect occurs, and teachers associate the lower (higher) share of minority names with a higher (lower) average achievement level, then both tests with Roma and non-Roma names are expected to receive higher evaluations and track recommendations when the share of minority students is higher, and therefore, the assumed achievement level is lower. But, again, the ethnic gap should be the same in the two conditions, with minority students receiving lower evaluations.
However, if the stereotypes associated with one ethnic group are more pervasive than those associated with the other group, then the ethnic composition of the names may moderate the effect of the individual names. This is because the additional contextual information may be less influential in the evaluation of the ethnic group that is subject to more robust stereotypes. For instance, if negative stereotypes about minority students are more pervasive than positive stereotypes about majority students, then contextual effects might not influence the evaluation of minority students. Consequently, they will receive lower evaluations regardless of their school context. Conversely, majority names might receive lower (in the case of additive stereotypes) or higher (in the case of a contrast effect) evaluations or recommendations when they appear together with many minority names because, due to less robust stereotypes, the contextual information affects their evaluations.
Institutional background
In Hungary, primary schools provide education in grades 1-8 (ISCED 1 and ISCED 2 levels, from age 6 to 14), whereas upper secondary education encompasses grades 9-12/13. Compulsory education lasts until the age of 16. Thus, after grade 8, students are obliged to choose from three different secondary school tracks. On the one extreme, grammar school is the academic and college-bound secondary track (gimnázium) that provides the high school final examination (érettségi). On the other extreme, vocational schools (szakközép iskola) prepare students for manual professions and trades without providing direct access to tertiary education. Finally, there is a mixed track (technikum) that provides the high school final examination and also gives a vocational diploma. Forty-five per cent of Hungarian secondary school students are enrolled in the academic track, 38 per cent are enrolled in the mixed track, and the remaining 17 per cent attend vocational schools (KSH, 2022) .
Admission to secondary education is merit-based and depends on students' achievement. On the one hand, students' end-of-semester grades in the core subjects, including Hungarian grammar and literature and mathematics, are taken into account. On the other hand, students participate in a centrally organized admission test in mathematics and Hungarian. The most competitive secondary schools also require an oral exam that the individual secondary schools organize for the applicants.
Although Hungary belongs to those countries where primary school teachers' track recommendations are not binding, teachers' informal recommendations influence students' school choices even in these educational systems (Caro et al., 2009; Boone and Van Houtte, 2013) . This is because advice given by primary school teachers is an important clue for parents about which track is most suitable for their children (Boone and Van Houtte, 2013) . Empirical studies from Hungary show that teachers communicate their track recommendations to the students, and families take into account these recommendations (Suhajda, 2017) . Track recommendations might be especially important for minority students and students from lower socioeconomic backgrounds who are less informed about the requirements of the different tracks than students with highly educated parents (Borgna et al., 2022; Keller, Takács and Elwert, 2022) .
Social context
The Roma minority is one of the largest ethnic minorities in Europe (O'Nions, 2016) . In Hungary, their share is estimated to be around 5-6 per cent of the total population and around 12-14 per cent of schoolage children (Kemény and Janky, 2006) . Roma people experience substantial economic and social exclusion (Kertesi and Kézdi, 2011) as well as residential and school segregation (Kemény and Janky, 2006; Kertesi and Kézdi, 2012) . As a result, Roma students have far lower average test scores than non-Roma students and are more likely to attend segregated schools providing low-quality education (Havas and Liskó, 2005; Kertesi and Kézdi, 2011; Hajdu, Kertesi and Kézdi, 2019) . Furthermore, Roma students are more likely to drop out of secondary school and less likely to attend the academic or the mixed track than non-Roma students. Therefore, they are less likely to obtain the final exam and continue their studies at the tertiary level (Kertesi and Kézdi, 2009; Hajdu, Kertesi and Kézdi, 2014) .
Experimental design
Study overview
We conducted a large-scale online experiment between 18 June and 6 September 2021. Our target population consisted of grammar and literature and mathematics teachers in the fifth to eighth grades (ISCED 2 level).
DISCRIMINATION IN TEACHER ASSESSMENTS AND TRACK RECOMMENDATIONS
We contacted every Hungarian primary school with an invitation letter sent to the schools' publicly available e-mail addresses. The invitation letter was addressed to the school principal. Principals were asked to forward the e-mail to the grammar and mathematics teachers in the school. Principals and teachers were told that the study focuses on differences in teachers' grading practices. The number of teachers allowed to participate in the experiment was limited to five mathematics and five grammar teachers per school. Teachers received 10,000 HUF (equivalent to 34.4 USD at the exchange rate from June 2021) for participation.
Overall, 193 grammar and 220 mathematics teachers participated in the online experiment. 2 Respondents were on average 49.9 years old (SD = 8.9) and had an average teaching experience of 21.9 years (SD = 10.5). The vast majority of the teachers were female in both subjects (grammar: 92.8 per cent; mathematics: 86.4 per cent). The age and gender distribution of participants are similar to those of the primary school teacher population in Hungary (more than 85 per cent of primary school teachers are female, and almost 50 per cent are above the age of 50, see Hajdu et al., 2022: pp. 76-79) . Supplementary Table S1 shows that the distribution of primary schools participating in the experiment represents the Hungarian primary schools well according to location, provider, and the share of socioeconomically disadvantaged students. Supplementary Table S2 shows the number of respondents per school.
Teachers' tasks in the experiment
Mathematics teachers were asked to correct six different solutions for the same mathematics test, whereas grammar teachers were asked to correct six different solutions for the same literacy test. The different solutions varied in quality. The fictitious student names on the tests were randomized. Around half of the teachers received the six tests with two Roma (one male and one female) and four non-Roma names (two male and two female), and the other half of the teachers received four Roma (two male and two female) and two non-Roma names (one male and one female). A detailed description of the experimental procedure and the selection of names can be found in the Supplementary Material Appendix A.
Teachers had four tasks in the experiment:
1. They first corrected and evaluated the tests on a 30-point scale.
Teachers were not provided with a solution because we wanted to avoid influencing teachers' grading practices. 2. Teachers assigned a grade to the test on the 5-grade scale used in the Hungarian educational system (1 = fail, 2 = pass, 3 = satisfactory, 4 = good, 5 = excellent). Schools (and often teachers within schools) apply different rules to translate test scores to grades. Therefore, it is possible that discrimination does not occur in assigning test scores, but in translating the test scores into grades. 3. Teachers recommended a secondary school track to the students whose tests they evaluated (1vocational track, 2-mixed track, 3-academic track). Before the track recommendations, we communicated to teachers fictitious, randomized test results that students received in the other school subject (for details, see the pre-analysis plan). We added this random noise since high school admission depends on both mathematics and literacy and grammar tests. Therefore, it is more natural to give a recommendation based on both test results. 4. After the experiment, teachers were asked to fill out a questionnaire focussing on background information. They also had the possibility to provide qualitative feedback.
Research ethics
The study was reviewed and approved by the IRB office at the HUN-REN Centre for Social Sciences. The IRB's decision was issued on 17 June 2021. Our data collection process corresponded to the recent General Data Protection Regulation of the European Union. All participants gave their informed consent prior to their participation in the research, and adequate steps were taken to protect participants' confidentiality.
Statistical analyses
Research transparency
Our statistical analysis follows the detailed pre-registration we submitted to the RCT Registry of the American Economic Association before beginning the fieldwork: https://doi.org/10.1257/rct.7838-2.0 . Any deviations from the original pre-analysis plan are indicated in the paper. We archived the data, analytic scripts, and an anonymized version of the pre-analysis plan on the project's page on the Open Science Framework: https:// osf.io/743df/ Data and questionnaires have also been reposited at the Research Documentation Centre of the HUN-REN Centre for Social Sciences: https://doi.org/10.17203/ KDK558 .
Main variables and coding
We pre-registered two primary outcome variables. First, the test score captures the total points teachers assigned to the test, which ranges from 0 to 30 in the case of both subjects. Figure 1 presents the empirical distribution of test scores in the experiment. The figure shows substantial variation in test scores in both subjects.
Second, the recommended secondary school track was recoded into a dummy variable, which equals 1 if the teacher recommended a secondary track that ends with the high school final exam (academic or mixed track), 0 otherwise. 3 Hypotheses are tested for these two outcome variables.
Our pre-registered secondary outcome variable is the grade assigned to the test (integer between 1 and 5).
The treatment variable Roma equals 1 if the name of the fictitious test-writer student is a Roma name; otherwise, Roma = 0.
We explore the effect of the ethnic context on discrimination with the variable fewer Roma names (fR), which equals 1 if teachers received a test package with two Roma and four non-Roma names and 0 if teachers received a test package with four Roma and two non-Roma names.
Empirical model
In our primary, confirmatory analysis, we pre-registered the following linear regression model with teacher and test fixed effects to test our hypotheses:
Y t,e,j = β 0 + β 1 × Roma t,e,j + β 2 × male t,e,j + ϕ e + θ t + t,e,j (1)
where Y t,e is the outcome variable (H1: test score, H2: track recommendation). The index t indicates teacher, the index e indicates test, and the index j indicates the name written on the test. The variable Roma indicates (=1) if the name of the fictitious test-writer student was a common Roma name; otherwise, Roma = 0. The variable male indicates (=1) if the fictitious test-writer student was male; otherwise, male = 0. ϕ e represents test fixed effects and θ t represents teacher fixed effects. β 1 is the coefficient of interest and represents the difference in teachers' evaluation regarding the Roma and non-Roma students' tests/track recommendations.
The coefficient has a causal interpretation because test packages were randomly assigned to teachers. Note that in the case of track recommendations, the estimated model is a linear probability model, as the outcome is binary. We calculate standard errors clustered at the teacher level. 4 In our primary analysis, we use one-sided t-tests to test our hypotheses.
In order to gain power and since teacher and test fixed effects are not needed for the identification as test packages were randomly assigned to teachers, we repeated the analyses without teacher and test fixed effects as a (not pre-registered) robustness check.
Our secondary, exploratory analysis consists of various tests. 5 First, we substitute Y t,e with our secondary outcome, which is the grade assigned to the test. Second, to reveal potential heterogeneities in the treatment effect, we investigate the interaction between gender and ethnicity, as well as between test quality and ethnicity, by including the appropriate interaction terms in the model. Third, we restrict our main analysis to teachers who might teach Roma students in their school and, thus, might have the opportunity to discriminate against Roma students in their everyday teaching practices. As Wenz and Hoenig (2020) argue based on Heckman (1998) , concentrating on teachers who have actual contact with Roma students provides a more accurate estimation of discrimination that happens in the everyday school context. For this analysis, we restrict the sample to teachers who are employed in schools with at least a 1 per cent share of Roma students (representing 83 per cent of the total sample). 6 Finally, we explore how the perceived ethnic context plays a role in discrimination. We estimate EQ2 to investigate whether the number of Roma names teachers encountered in the experiment moderates the effect of ethnicity on the outcomes: Y t,e,j = β 0 + β 1 × Roma t,e,j + β 2 × male t,e,j
+ β 3 × f R t + β 4 × f R t × Roma t,e,j
+ ϕ e + t,e,j
(2)
where the coefficient β 3 is a dummy variable indicating the share of Roma names in the six tests (1 = fewer Roma than non-Roma names, 0 = more Roma than non-Roma names). As teachers received a random set of tests, β 3 has a causal interpretation. The coefficient β 4 captures the difference in the treatment effects in test packages with fewer and more Roma names. 7 Test fixed effects are always included in these specifications because our randomization procedure focussed on balancing the ethnicity of the individual names but not the ethnic composition of the names across the different tests. Since our sample was not large enough to completely balance the share of Roma names across the different tests (see Supplementary Table S3 ), and thus, across different test qualities, test fixed effects are needed to control for test quality in this exploratory analysis.
Results
Main results
Raw differences in the dependent variables by the ethnicity and gender of the fictitious names are presented in Table 1 . Tests with non-Roma names received higher track recommendations on average than tests with Roma names (0.70 vs. 0.66, P = 0.03), whereas tests with male names received higher test scores on average than tests with female names (19.05 vs. 18.58, P = 0.03).
Table 2 shows the results of the main regression analysis. Models 1 and 2 present the results of the pre-registered confirmatory analysis, testing H1 and H2, respectively. Models 4 and 5 show the same analysis without teacher and test fixed effects. Regardless of the specification, teachers did not evaluate tests with Roma names less favourably than tests with non-Roma names (β 1 = -0.011, P = 0.465 in Model 1). Therefore, Hypothesis 1 is not supported. 8 With regard to Hypothesis 2, in the pre-registered model, teachers were 2.6 percentage points less likely to recommend a secondary school track ending with the high school final exam for Roma students than for non-Roma students (β 1 = -0.026, P = 0.042, Model 2). Without teacher and test fixed effects, the ethnic difference is 4 percentage points (β 1 = -0.040, P = 0.019, Model 5). The difference between the parameter estimates obtained in Model 2 and Model 5 is statistically not significant. The estimated effect sizes are -0.057 SD and -0.086 SD of the outcome variable in Model 2 and Model 5, respectively. After applying the Benjamini-Hochberg (1995) procedure for multiple hypothesis testing, the coefficient is not significant in the pre-registered model at the 0.05 level. The lack of statistical significance in the pre-registered model is likely the consequence of the fact that we found an effect size that is half as large as the expected effect size we used in the power calculations to design our sample size. Therefore, we interpret this finding as evidence for Hypothesis 2, though the effect size is small. Models 3 and 6 show that results for the secondary outcome variable-grade assigned to the test-are similar to that of test scores. Teachers did not assign lower grades to Roma than to non-Roma students. Notes: Test score ranges from 0 to 30. Grade ranges from 1 to 5. Recommendation equals 1 if the teacher recommended a secondary track that ends with a high school final exam, 0 otherwise. The difference is significant between Roma and non-Roma students in the case of track recommendations, and between female and male students in the case of test scores (at the P < 0.05 level).
Interaction effects with gender and test quality are presented in the Supplementary Material Appendix B. No significant interaction was found between gender and ethnicity (see Supplementary Table S5 ). However, Supplementary Figure S2 and Supplementary Table S6 suggest that discrimination in track recommendation is more pronounced in the middle range of test scores than at the bottom and the top of the performance distribution.
We repeated the main analysis by restricting the sample to those teachers who teach in schools with at least a 1 per cent share of Roma students and, thus, might have an opportunity to discriminate against Roma students in their everyday teaching practices. These schools constitute 83 per cent of the sample. The results are presented in Supplementary Table S7 . For track recommendations, the parameter estimates and effect sizes are slightly smaller in this subsample (-0.049 SD and -0.076 SD of the outcome variable in Models 2 and 5, respectively). Though the estimates are statistically not significant in this restricted sample, the sign and the size of the parameters are very close to those estimated from the entire sample.
The role of the ethnic context
In this section, we explore how the ethnic context affects teacher discrimination. Specifically, we investigate differences between teachers who received tests with fewer Roma than non-Roma names and teachers who received tests with more Roma than non-Roma names. 9
For test scores and grades as outcome variables, the interaction between the Roma dummy and the share of Roma names is not significant (Models 1 and 3 in Table 3 ). At the same time, discrimination in track recommendations is associated with the share of Roma names encountered by the respondents. Table 3 Model 2 shows that teachers who received tests with more Roma names were less likely to discriminate in track recommendations than teachers who received tests with fewer Roma names (fR * Roma: -0.10, P = 0.045). Specifically, the results show that teachers gave similar track recommendations for Roma students in both experimental conditions (the linear combination of the fR + fR * Roma variables: -0.02, P = 0.530). At the same time, they recommended a lower track for non-Roma students if they received tests with more Roma names compared with the condition when they received tests with fewer Roma names (the parameter estimate for the fR variable: 0.07, P = 0.022, the effect size is -0.151 SD of the outcome variable). In the former case, track recommendations for non-Roma students were as low as those for Roma students in both experimental conditions. These findings are graphically presented in Figure 2 , which shows the estimated probabilities of track recommendations for Roma and non-Roma students by the number of Roma names in the experiment.
Discussion
We found mixed evidence for discrimination against Roma students in test evaluations and track recommendations among grade 5-8 teachers in Hungary. In The estimated average treatment effect in track recommendations is substantively small but comparable to the findings of previous experimental studies on discrimination in teacher assessment from other countries. It is not possible to make a direct comparison between our effect sizes and those reported in previous studies on track recommendations due to differences in the dependent variables and analytical strategies employed. However, it is possible to compare our effect sizes on track recommendations with those reported in previous studies on grading. In our case, the estimated effect sizes are between -0.057 and -0.086 SD of the outcome variable, depending on the model specification. Previous studies have found effect sizes between -0.01 and -0.12 SD (van Ewijk, 2011; Hanna and Linden, 2012; Sprietsma, 2013; Wenz and Hoenig, 2020) . Thus, our effect size lies in the middle of the distribution of effect sizes identified in other studies.
Our finding that teachers do not discriminate against Roma students in test evaluations but do so in track recommendations is in line with previous experiments conducted among preservice teachers studying at Hungarian universities (Takács, 2018; Bruneau et al., 2020) and among in-service teachers in other countries (van Ewijk, 2011; Wenz and Hoenig, 2020) . Although track recommendation is not binding in Hungary, families take into account teachers' advice in their secondary school choices (Suhajda, 2017) . Discrimination in track recommendations might exacerbate educational inequalities, especially in highly stratified educational systems where track placement substantially determines educational and labour market outcomes (Buchmann and Park, 2009) .
Exploratory analyses showed that discrimination in track recommendation is more pronounced in the middle range of the performance distribution than at the bottom and the top of the distribution. These findings align with those of Wenz and Hoenig (2020) , who observed discrimination in the case of an average-quality essay but did not find discrimination in the case of a low-quality essay. We contributed to this line of research by showing that discrimination does not occur at the upper end of the performance distribution either. This may also elucidate discrepancies in the findings of different experimental studies, whereby the presence or absence of evidence for discrimination may Notes: Robust standard errors clustered at the teacher level in parentheses. Fewer Roma names is an indicator variable for test packages with two Roma and four non-Roma names (vs. four Roma and two non-Roma names). The analysis is restricted to the three mathematics and three grammar tests where Roma and non-Roma names occur in both experimental conditions. *P < 0.05.
be contingent on the quality of the tests employed the experiment. Further exploratory analyses showed that contextual factors play a substantial role in discrimination in track recommendations. On the one hand, we found that teachers who received tests with fewer Roma than non-Roma names tended to discriminate against Roma students in track recommendations. For this subsample, the effect size is considerably larger than for the entire sample (-0.151 SD of the outcome variable). On the other hand, teachers who received tests with more Roma than non-Roma names did not discriminate. Sprietsma (2013) found similar evidence for a moderating effect of ethnic composition: in Germany, teacher discrimination was smaller if teachers were presented with a higher number of ethnic-sounding names. In contrast, van Ewijk (2011) did not find such a moderating effect in the Netherlands.
We also investigated whether the ethnic composition of the names moderated the effect of the individual names by influencing track recommendations for Roma students, non-Roma students, or both. We found that teachers who encountered more Roma in the experiment gave lower recommendations for non-Roma students than teachers who encountered fewer Roma names, whereas track recommendations for Roma students were relatively similar in the two experimental conditions. Therefore, when teachers received tests with more Roma names, Roma and non-Roma students received similarly low track recommendations. This finding is consistent with a recent vignette experiment (Keller, 2024) , which showed that in classrooms with a higher share of Roma students, teachers evaluated the performance of both Roma and non-Roma students as lower than in classrooms with a lower share of Roma students.
One possible explanation for this finding is that a higher number of ethnic-sounding names have triggered perceptions of segregated schools: teachers interpreted the composition of the minority and majority names on the tests as signals of the ethnic composition of the schools these students presumably came from. Teachers might have relied on their prior knowledge that students in segregated schools have lower school performance on average and recommended a lower track for both Roma and non-Roma students who were associated with a segregated environment. This interpretation is consistent with the everyday experiences of teachers. In Hungary, segregated schools provide lower-quality education (Havas and Liskó, 2005) , and in segregated schools, academic achievement is similar between Roma and non-Roma students (Kertesi and Kézdi, 2011, 2016) .
It is important to note, however, that Roma students did not receive higher track recommendations in test packages with more non-Roma names relative to test
DISCRIMINATION IN TEACHER ASSESSMENTS AND TRACK RECOMMENDATIONS
packages with fewer non-Roma names. This that Roma ethnicity is a strong for teachers, and its negative effect is not moderated by the ethnic composition of the school (e.g., the share of Roma students in the test package). Another possible explanation for the moderating effect of the ethnic composition of the names could be social desirability bias: perhaps teachers who received tests with more Roma names realized the true aim of our study, whereas teachers who received tests with fewer Roma names did not. However, in this scenario, we would expect that social desirability bias generates higher track recommendations for Roma students when the share of Roma names is high compared with recommendations for Roma students in the other case. Put differently, in the case of social desirability bias, track for non-Roma students should be stable, whereas track recommendations for Roma students should vary across the two experimental conditions. We find the opposite pattern; therefore, social desirability bias is not a likely explanation for our results.
Another alternative explanation that we cannot completely rule out is the minority hypothesis, which states that students who represent the numerical minority in a class are more salient and, therefore, attract more attention from teachers than students who represent the numerical majority (Ready and Wright, 2011; Kaiser, Südkamp and Möller, 2017) . This increased attention could lead to more accurate teacher judgements for students in the minority position compared with students in the majority position (Kaiser, Südkamp and Möller, 2017) . In our experimental design, we cannot observe how accurate teachers' evaluations are because we do not have an exact measure of student ability against which to compare the teacher-assigned test scores. We only observe the test scores that teachers assign to the students, which are endogenous. This is also true for track recommendation, as there is no exact cut-off above which students could be recommended to the higher school tracks. In sum, we cannot test whether the minority hypothesis played a role in our experimental setting.
Our findings are consistent with stereotype-based theories of discrimination for three main reasons. First, discrimination in track recommendations but not in test evaluations is more in line with stereotype-based than with taste-based discrimination (Wenz and Hoenig, 2020) . For evaluating the tests, all information was available to the teachers. However, when recommending a secondary school track, teachers usually take into account several other factors besides students' academic achievement (Boone and Van Houtte, 2013) . Qualitative feedback from the participating teachers also supports this: some teachers explicitly wrote that they were missing additional information about the individual students to give a proper track recommendation. Since students' efforts, parental support, and family background were not observable in the experimental situation, teachers needed to rely on their prior beliefs or stereotypes when recommending a track.
Second, the pattern of discrimination in track recommendations depending on the ethnic context provides further indirect evidence for stereotype-based discrimination. It suggests that instead of relying on ethnic prejudices, teachers were actively looking for additional information about the characteristics of students they needed to recommend a track. School composition provides additional information for teachers and can be used as an anchor for beliefs on individual student characteristics.
Third, the fact that discrimination occurred in the middle range of the performance distribution contradicts a simple model of taste-based discrimination and is more in line with social psychological theories of stereotype-based discrimination (Fiske and Neuberg, 1990; Fiske, 1998) . In the case of a large deviation from the stereotypical performance level (very high performance for Roma students and very low performance for non-Roma students), teachers might have updated their category-based judgement with an assessment based on individual performance. In contrast, essays of average quality might not have been striking enough to move teachers' judgements from a category-based response, and therefore, teachers might have relied on their group-specific stereotypes (for a similar argument, see Wenz and Hoenig, 2020) .
Experimental tests of discrimination have their limitations. First, teachers know that their test evaluations and track recommendations do not affect real students. Because of the low-stakes nature of the task, they may behave differently than they do in school. They might be less careful or pay less attention to students' background characteristics because they are told they are supposed to focus on grading. These design elements suggest the possibility of underestimating discrimination.
It is important to note that the external validity of identifying discrimination in an experimental situation is limited. The fact that teachers rely on stereotypes in the experiment does not imply that they discriminate in real-world track recommendations; as in the latter case, they do have further information on their individual students that is missing in the experiment. The results are still relevant, though. First, if teachers have strong prior beliefs on the typical characteristics of certain groups, they might discriminate in track recommendations despite the availability of individual information. Second, the experimental design closely resembles those situations where teachers need to evaluate students they meet, such as high school admission exams. Therefore, results might signal discrimination in high school admission decisions where teachers act as gatekeepers and might not admit talented minority students to knowledge-intensive secondary tracks.
Another limitation of our study is that our experimental design did not distinguish between ethnic discrimination and discrimination based on socioeconomic status (Wenz and Hoenig, 2020; Crabtree et al., 2022) . Previous research suggests that ethnic minority students are discriminated against not only because of their ethnic minority status, but also because of their low social status (Berényi, Berkovits and Erőss, 2008; Wenz and Hoenig, 2020; Kisfalusi, Janky and Takács, 2021) . Our estimate thus captures discrimination against Roma students without disentangling how much of it is due to ethnic discrimination and how much to discrimination based on socioeconomic status. However, due to the very strong correlation between ethnicity and socioeconomic status, Roma students are likely to face the combined effect of these two types of discrimination.
Our findings have important methodological implications for the design of discrimination experiments. Our results show that if participants encounter more than one name, they only rely on the characteristics of the sole individuals but also consider the composition of all individuals in the experiment. As in our study, these results stem from exploratory analyses; future experimental studies should confirm these findings by conducting a pre-registered analysis. Nevertheless, our results suggest that experimental studies need to take into account that the social context participants infer from the experimental design might influence their discriminatory tendencies. Pre-registered experiments are also needed to confirm our findings on the moderating effect of test quality. Furthermore, future studies could design experiments to explicitly test the theoretical mechanism underlying discrimination.
By providing experimental evidence of discrimination in track recommendations among in-service teachers in Hungary, our study highlighted an important mechanism that can exacerbate educational inequalities. In this regard, our findings suggest that Roma students receive lower track recommendations, even if their performance is identical to that of non-Roma students. Raising teachers' awareness of potential biases and their consequences in student evaluation and track recommendations might reduce discrimination against minority students (Alesina et al., 2024) .
Notes
1. In Hungary, teachers are responsible for compiling the tests their students write. Given that most mathematics and grammar teachers only teach in one school, it is very unlikely that students in different schools will write the same tests in a given subject. Based on this institutional feature, it is reasonable to assume that if teachers are provided with different solutions for the same test, they will assume that the tests originate from the same student community. 2. Based on our power calculations using the Optimal Design program (Spybrook et al., 2011) , we aimed to have at least 400 respondents in the experiment, each grading 6 tests. With 400 respondents, our design's minimum detectable effect size was 0.11. Since prior research found an effect size of 0.12 comparing Turkish and native German (Sprietsma, 2013; Wenz and Hoenig, 2020) , our design was well powered to detect a similar-sized treatment effect among Roma and non-Roma Hungarians. 3. Compared to the pre-registered analysis plan, we use reversed coding, which is more intuitive concerning our substantive research question. 4. We deviate from our pre-registered models as we cluster standard errors at the teacher level (instead of the school level). Based on the paper by Abadie et al. (2023) , clustering is more appropriate at the teacher level in our case. Our two pre-registered models are also presented with standard errors clustered at the school level in Supplementary Table S4 .
Clustering of the standard errors at the teacher level instead of the school level does not substantially alter our results. 5. Since the secondary analyses have exploratory purposes, we do not test pre-registered hypotheses. Therefore, we report two-sided t-tests in the tables. 6. The share of Roma students in the school stems from administrative data merged with our experimental data. It is estimated by principals in the school survey attached to the National Assessment of Basic Competences, which is a standardized yearly test measuring reading literacy and mathematics skills for the full population of sixth-, eighth-, and tenth-grade students in the country. 7. As the use of the teacher fixed effects holds the test evaluation/track recommendation for non-Roma students as constant across the two experimental conditions (fewer vs. more Roma names), it is not possible to estimate the difference in the test evaluation/track recommendation for non-Roma students by the two experimental conditions. In order to be able to estimate the effect of the share of Roma names separately for both Roma and non-Roma students, we have removed the teacher fixed effects from this estimation. 8. Although we did not have a pre-registered hypothesis on gender discrimination, it should also be noted that teachers tended to discriminate against female students in test evaluations. The finding that teachers discriminated against female students suggests that the absence of discrimination against Roma students in test evaluations is not because the tests we used did not provide enough room for subjective evaluation in teacher assessments. 9. It is important to note that half of the tests were paired with only Roma or only non-Roma names if we split the sample according to the composition of names in the experiment (e.g., test B was paired with only Roma names in the fewer Roma names condition and with only non-Roma names in the more Roma names condition, see Supplementary Table S3 ). Therefore, this analysis is restricted to the three
Figure 1
1Figure 2
2Table 1
1| Non-Roma | Roma | Female | Male | Total | ||||||
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | Mean | SD | |
| Test score | 18.90 | 5.28 | 18.73 | 5.40 | 18.58 | 5.35 | 19.05 | 5.32 | 18.82 | 5.34 |
| Grade | 3.09 | 0.99 | 3.08 | 1.03 | 3.06 | 1.01 | 3.11 | 1.01 | 3.09 | 1.01 |
| Recommendation | 0.70 | 0.46 | 0.66 | 0.47 | 0.67 | 0.47 | 0.70 | 0.46 | 0.68 | 0.47 |
| N | 1234 | 1244 | 1239 | 1239 | 2478 |
Table 2
2| (1) | (2) | (3) | (4) | (5) | (6) | ||
| Test score | Track recommendation | Grade | Test score | Track recommendation | Grade | ||
| Roma | -0.011 | -0.026* | 0.018 | -0.164 | -0.040* | -0.013 | |
| SE P-value a Male SE P-value Test FE Teacher FE Constant Observations R-squared Number of teachers | (0.121) 0.465 0.327*** (0.092) <0.001 Yes Yes Yes 2,478 0.781 413 | (0.015) 0.042 0.016 (0.013) 0.226 Yes Yes Yes 2,478 0.425 413 | (0.024) 0.232 0.035 (0.021) 0.101 Yes Yes Yes 2,478 0.683 413 | (0.206) 0.213 0.464** (0.136) 0.001 No No Yes 2,478 0.002 413 | (0.019) 0.019 0.028 (0.016) 0.080 No No Yes 2,478 0.003 413 | (0.038) 0.369 0.057* (0.027) 0.033 No No Yes 2,478 0.001 413 | Downloaded from https://academic.oup.com/esr/advance-article/doi/10.1093/esr/jcae044/7902057 by guest on 17 November 2024 |
Table 3
3| (1) | (2) | (3) | |
| Test score | Track recommendation | Grade | |
| Roma | 0.149 | 0.026 | 0.056 |
| SE | (0.187) | (0.031) | (0.043) |
| P-value | 0.425 | 0.403 | 0.200 |
| Male | 0.039 | 0.027 | -0.057 |
| SE | (0.212) | (0.022) | (0.050) |
| P-value | 0.853 | 0.228 | 0.255 |
| Fewer Roma names | 0.203 | 0.074* | 0.040 |
| SE | (0.236) | (0.032) | (0.057) |
| P-value | 0.390 | 0.022 | 0.490 |
| Fewer Roma names * Roma | -0.103 | -0.095* | -0.021 |
| SE | (0.254) | (0.045) | (0.061) |
| P-value | 0.685 | 0.036 | 0.729 |
| Test FE | Yes | Yes | Yes |
| Teacher FE | No | No | No |
| Constant | Yes | Yes | Yes |
| Observations | 1,239 | 1,239 | 1,239 |
| R-squared | 0.804 | 0.459 | 0.685 |
| Number of teachers | 413 | 413 | 413 |
References
- When should you adjust standard errors for clustering? A Abadie The Quarterly Journal of Economics 138 2023
- Statistical Theories of Discrimination in Labor Markets Dennis J Aigner Glen G Cain 10.1177/001979397703000204 ILR Review ILR Review 0019-7939 2162-271X 30 2 1977 SAGE Publications
- Revealing stereotypes: evidence from immigrants in schools A Alesina American Economic Review 114 2024
- THE THEORY OF DISCRIMINATION Kenneth J Arrow 10.1515/9781400867066-003 Discrimination in Labor Markets A Rees Princeton, NJ Princeton University Press 1973
- The Economics of Discrimination G S Becker 1957. 14 April 2020 The University of Chicago Press
- Controlling the false discovery rate: a practical and powerful approach to multiple testing author Y Benjamini Y Hochberg Journal of the Royal Statistical Society Series B: Statistical Methodology 57 1995
- Iskolarendszer és szabad választás. A jóindulatú szegregációról E Berényi B Berkovits G Erőss Iskolarend. Kiváltság És Különbségtétel a Közoktatásban E Berényi Budapest Gondolat 2008
- School segregation, educational attainment, and crime: evidence from the end of busing in Charlotte-Mecklenburg S B Billings D J Deming J Rockoff The Quarterly Journal of Economics 129 2014
- Toward a Theory of Minority-Group Relations H M Blalock 1967 Wiley New York
- Measuring Racial Discrimination Blank, R. M. et al. 2004 The National Academies Press Washington, DC
- Race prejudice as a sense of group position H Blumer The Pacific Sociological Review 1 1958
- Inaccurate statistical discrimination: an identification problem J A Bohren The Review of Economics and Statistics 2023
- Why are teacher recommendations at the transition from primary to secondary education socially biased? A mixed-methods research S Boone M Van Houtte British Journal of Sociology of Education 34 2013
- Migrant achievement penalties in Western Europe: Do educational systems matter? C Borgna D Contini European Sociological Review 30 2014
- Old habits die hard? School guidance interventions and the persistence of inequalities C Borgna Research in Social Stratification and Mobility 81 100728 2022
- Racial discrimination in grading: evidence from Brazil F Botelho R A Madeira M A Rangel American Economic Journal: Applied Economics 7 2015
- Beyond dislike: blatant dehumanization predicts teacher discrimination E Bruneau Group Processes & Intergroup Relations 23 2020
- The effect of immigration on the performance of natives: cross country evidence using PISA test scores G Brunello L Economics of Education Review 32 2013
- Stratification and the formation of expectations in highly differentiated educational systems C Buchmann H Park Research in Social Stratification and Mobility 27 2009
- Test scores, subjective assessment, and stereotyping of ethnic minorities S Burgess E Greaves Journal of Labor Economics 31 2013
- Biased grades? Changes in grading after a blinding of examinations reform M Bygren Assessment & Evaluation in Higher Education 45 2020
- The role of academic achievement growth in school track recommendations D H Caro Studies in Educational Evaluation 35 2009
- Racially distinctive names signal both race/ethnicity and social class C Crabtree Sociological Science 9 2022
- Economic and sociological views of discrimination in labor markets: persistence or demise? P England P Lewin Sociological Spectrum 9 1989
- Aspiration squeeze: the struggle of children to positively selected immigrants P Engzell Sociology of Education 92 2019
- Stereotyping, prejudice, and discrimination S T Fiske The Handbook of Social Psychology D Gilbert New York McGraw-Hill 1998
- A continuum of impression formation, from category-based to individuating processes: influences of information and motivation on attention and interpretation S T Fiske S L Neuberg Advances in Experimental Social Psychology M P Zanna Amsterdam, NL Academic Press 1990. 19 March 2020 23
- Inequality in Teacher Judgements, Expectations and Track Recommendations: A Review Study S Geven A Batruch H Van De Werfhorst 2018 Amsterdam: Universiteit van Amsterdam
- Teachers' attitudes towards ethnic minority students: effects of schools' cultural diversity S Glock C Kovacs I Pit-Ten Cate The British Journal of Educational Psychology 89 2019
- T Hajdu Z Hermann D Horn H Hőnich J Varga A Közoktatás Indikátorrendszere 2021 2022 The Indicator System of the Hungarian Public Education 2021
- Krtk Kti July 2022 Budapest
- Roma fiatalok a középiskolában : Beszámoló a TÁRKI Életpálya-felmérésének 2006 és 2012 közötti hullámaiból Tamás Hajdu Gábor Kertesi Gábor Kézdi 10.61501/trip.2014.12 Társadalmi Riport T Kolosi Budapest TÁRKI Társadalomkutatási Intézet Zrt. 2014. 2014
- Inter-ethnic friendship and hostility between Roma and non-Roma students in Hungary: the role of exposure and academic achievement T Hajdu G Kertesi G Kézdi The B.E. Journal of Economic Analysis & Policy 19 2019
- Discrimination in grading R N Hanna L L Linden American Economic Journal: Economic Policy 4 2012
- Szegregáció a Roma Tanulók Általános Iskolai Oktatásában [Segregation in the Education of Roma Primary School Students] G Havas I Liskó Budapest: Felsőoktatási Kutatóintézet 2005. July 2022
- Guest editorial: explaining ethnic inequalities in educational attainment A Heath Y Brinbaum Ethnicities 7 2007
- Detecting discrimination J J Heckman Journal of Economic Perspectives 12 1998
- School segregation, student achievement, and educational attainment in Hungary Z Hermann D Kisfalusi International Journal of Comparative Sociology 2023. 00207152231198434
- Weak performance--strong determination: school achievement and educational choice among children of immigrants in Sweden J O Jonsson F Rudolphi European Sociological Review 27 2011
- The unbearable accuracy of stereotypes L Jussim Handbook of Prejudice, Stereotyping, and Discrimination T D Nelson New York, NY Psychology Press 2009
- The effects of student characteristics on teachers’ judgment accuracy: Disentangling ethnicity, minority status, and achievement. Johanna Kaiser Anna Südkamp Jens Möller 10.1037/edu0000156 Journal of Educational Psychology Journal of Educational Psychology 0022-0663 1939-2176 109 6 2017 American Psychological Association (APA)
- Teachers' perceptions of students' school performance: the impact of classroom composition. Evidence from a survey experiment T Keller Education Economics 2024
- Yes, you can! Effects of transparent admission standards on high school track choice: a randomized field experiment T Keller K Takács F Elwert Social Forces 101 2022
- Roma population of Hungary 1971-2003 I Kemény B Janky Roma of Hungary. East European Monographs I Kemény 2006 Columbia University Press New York
- Roma és nem roma fiatalok középiskolai továbbtanulása G Kertesi G Kézdi Oktatás És Foglalkoztatás K Fazekas Budapest KTI 2009 12
- The Roma/non-Roma test score gap in Hungary G Kertesi G Kézdi American Economic Review 101 2011
- Ethnic segregation between Hungarian schools: long-run trends and geographic distribution G Kertesi G Kézdi Hungarian Statistical Review 90 2012
- On the test score gap between Roma and non-Roma students in Hungary and its potential causes G Kertesi G Kézdi Economics of Transition 24 2016
- Roma students' academic self-assessment and educational aspirations in Hungarian primary schools D Kisfalusi British Journal of Sociology of Education 44 2023
- Grading in Hungarian primary schools: mechanisms of ethnic discrimination DISCRIMINATION IN TEACHER ASSESSMENTS AND TRACK RECOMMENDATIONS against Roma students D Kisfalusi B Janky K Takács European Review 37 2021
- immigrants and girls graded worse? Results of a matching approach D Kiss Oktatási Adatok, 2021/2022 (Előzetes Adatok), available from KSH 2013. 2022. May 2022 21
- Comparing Teachers' Assessments and National Test Results-Evidence from Sweden E Lindahl Working Paper Series 24 2007. 2007. February 2017 Uppsala, Sweden IFAU-Institute for Evaluation of Labour Market and Education Policy
- Subtle discrimination: Do stereotypes among teachers trigger bias in their expectations and widen ethnic achievement gaps? G Lorenz Social Psychology of Education 24 2021
- The big-fish-little-pond effect on academic self-concept. Herbert W Marsh 10.1037/0022-0663.79.3.280 Journal of Educational Psychology Journal of Educational Psychology 0022-0663 1939-2176 79 3 1987 American Psychological Association (APA)
- Teacher expectations, classroom context, and the achievement gap C Mckown R S Weinstein Journal of School Psychology 46 2008
- Mind the overlap in multiple categorization: a review of crossed categorization, intersectionality, and multiracial perception G Nicolas M De La Fuente S T Fiske Group Processes & Intergroup Relations 20 2017
- Minority Rights Protection in International Law: The Roma of Europe H O'nions 2016 Routledge London, UK
- Assessed by a Teacher Like Me: Race, Gender and Subjective Evaluations. London, UK: Centre for the Economics of Education A Ouazad 2008. February 2017
- The Sociology of Discrimination: Racial Discrimination in Employment, Housing, Credit, and Consumer Markets Devah Pager Hana Shepherd 10.1146/annurev.soc.33.040406.131740 Annual Review of Sociology Annu. Rev. Sociol. 0360-0572 1545-2115 34 1 2008 Annual Reviews
- The statistical theory of racism and sexism E Phelps The American Economic Review 62 1972
- Prejudice as a response to perceived group threat: population composition and anti-immigrant and racial prejudice in Europe L Quillian American Sociological Review 60 1995
- New approaches to understanding racial prejudice and discrimination L Quillian Annual Review of Sociology 32 2006
- Accuracy and inaccuracy in teachers' perceptions of young children's cognitive abilities: the role of child background and classroom context D D Ready D L Wright American Educational Research Journal 48 2011
- Why do immigrant students aim high? Explaining the aspiration-achievement paradox of immigrants in Germany Z Salikutluk European Sociological Review 32 2016
- Sociological perspectives on racial discrimination M L Small D Pager Journal of Economic Perspectives 34 2020
- Discrimination in grading: experimental evidence from primary school teachers M Sprietsma Empirical Economics 45 2013
- Optimal design plus empirical evidence: Documentation for the J Spybrook H Bloom R Congdon C Hill A Martinez S Raudenbush A To Optimal Design" software 2011. February 2020
- Referenzgruppeneffekte im Simulierten Klassenraum A Südkamp J Möller Zeitschrift für Pädagogische Psychologie 23 2009
- A Pályaorientációs Tevékenység Változása És Megvalósulása a Köznevelésben a Rendszerváltozástól Napjainkig, Különös Tekintettel Az Információs Folyamatokra C J Suhajda 2017 Pécsi Tudományegyetem Pécs
- Social psychology of intergroup relations H Tajfel Annual Review of Psychology 33 1982
- An integrative theory of intergroup conflict H Tajfel J Turner Austin, W. G. and Worchel, S. 1979 The Social Psychology of Intergroup Relations. Monterey: Brooks/Cole
- Személynév-etnikai sztereotípia-előítélet J Takács Névtani Értesítő 40 2018
- Track recommendation bias: gender, migration background and SES bias over a 20-year period in the Dutch context A C Timmermans British Educational Research Journal 44 2018
- Are children of immigrants graded less generously by their teachers than natives, and why? Evidence from student population data in Italy M Triventi International Migration Review 54 2020
- Social comparison and social identity: some prospects for intergroup behaviour J C Turner European Journal of Social Psychology 5 1975
- Same work, lower grade? Student ethnicity and teachers’ subjective assessments Reyn Van Ewijk 10.1016/j.econedurev.2011.05.008 Economics of Education Review Economics of Education Review 0272-7757 30 5 2011 Elsevier BV
- Ethnic and social class discrimination in education: experimental evidence from Germany S E Wenz K Hoenig Research in Social Stratification and Mobility 65 100461 2020
- Discrimination in grading: A scoping review of studies on teachers’ discrimination in school Giulietta Zanga 0000-0002-4902-8392 Elena De Gioannis 0000-0003-4341-3166 10.1016/j.stueduc.2023.101284 Studies in Educational Evaluation Studies in Educational Evaluation 0191-491X 78 101284 2023 Elsevier BV
Metadata
Issues
No public issues have been filed for this DOI.
Submit an issue
Record history
| When | Event | Field | Old | New |
|---|---|---|---|---|
| 2026-06-18 19:37:53.011249+00:00 | identifier_assigned | DSEID | DSEID-001-4372361 | |
| 2026-06-18 15:20:12.157870+00:00 | pdf_processed | pdf_sha256 | a021dcb3c182f20b5d8c0d0d57f8c35350c7785ace681a70ac4e425f51b598ec |