Gender Difference in Mathematics Achievement of Total, Low-, and High-Achieving Students: Evidence from East Asian Countries’ TIMSS 2015 Mathematics Achievement

Kim, Meereem; Kwak, Minho

doi:10.29221/jce.2018.21.4.99

J. Curric. Eval. 2018; 21(4):99-124

pISSN: 1229-1544

DOI: https://doi.org/10.29221/jce.2018.21.4.99

교육평가

Gender Difference in Mathematics Achievement of Total, Low-, and High-Achieving Students: Evidence from East Asian Countries’ TIMSS 2015 Mathematics Achievement

김미림¹^,^*, 곽민호²^,^**

전체 및 저성취, 고성취 학생들의 수학 성취도 성차 분석: 동아시아권 국가의 TIMSS 2015 수학 성취도를 중심으로

Meereem Kim¹^,^*, Minho Kwak²^,^**

Author Information & Copyright ▼

¹한국교육개발원 연구원

²조지아대학교 박사과정

¹Researcher, Korean Educational Development Institute

²Doctoral Student, University of Georgia

^*First author, mrkim@kedi.re.kr

^**Corresponding author, minho.kwak25@uga.edu

© Copyright 2018, Korea Institute for Curriculum and Evaluation. This is an Open-Access article distributed under the terms of the Creative Commons Attribution NonCommercial-ShareAlike License (http://creativecommons.org/licenses/by-nc-sa/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Received: Oct 04, 2018 ; Revised: Nov 21, 2018 ; Accepted: Nov 22, 2018

Published Online: Nov 30, 2018

<Abstract>

Gender difference in mathematics achievement has been one of the most controversial issues in the academic area of mathematics over the last few decades. Recent studies suggested that the gender gap tends to decrease over the years, but the difference is much more apparent in higher mathematical ability levels. This paper examined on the gender gap in TIMSS 2015 mathematics achievement of total, low-, and high-achieving students using mutlilevel modeling. Also, this study focused on the gender gap in East Asian countries which outperformed in TIMSS 2015. The results suggest that the gender gap can vary depending on the level of mathematics achievement. For total students, the gender gap was detected in all of the countries except Hong Kong SAR. For low-achieving students, the gender difference was significant in Chinese Taipei, Hong Kong SAR, and Singapore. For high-achieving students, the gender gap was not present in all countries, which suggests that the gender gap in these countries differs from that of the Western area. This study showed that the gender gap in mathematics achievement of East Asian countries could be different according to mathematical ability. Also, this study demonstrated the impact of controlling influential variables on mathematics achievement in studying gender differences by suggesting that the results of the simple comparison of the mean achievement scores were different from the results obtained from the multilevel model controlling important variables on mathematics achievement.

국문요약

성별에 따른 수학 성취도 차이는 수십 년간 학계에서 논란의 중심이 되어왔다. 최근 연구에 따르면 남학생과 여학생의 수학 성취도 차이는 전반적으로는 점차 감소해왔으나, 수학 성취도가 높은 학생들의 경우 그 차이가 확연히 드러나는 양상을 취하는 것으로 나타났다. 이 연구에서는 TIMSS 2015 자료를 활용하여 전체 학생 및 저성취, 고성취 학생을 대상으로 성별에 따른 수학 성취도 차이를 분석하였다. 또한, 수학 성취도와 관련 있는 학교 및 학생 변인을 통제하기 위해 다층모형을 사용하였다. 분석 대상 국가는 TIMSS 2015 수학 성취도 평가에서 최상위권을 차지한 동아시아권의 5개 국가(대만, 홍콩, 일본, 한국, 싱가포르)로 한정하였다. 연구 결과, 동아시아권 국가의 수학 성취도 성차는 성취도 수준에 따라 다르게 나타났다. 구체적으로, 전체 학생을 대상으로 했을 때 홍콩을 제외한 4개국에서 유의한 성차가 나타났다. 저성취 학생의 경우 대만, 홍콩, 싱가포르에서 유의한 성차가 나타났다. 고성취 학생의 경우 어느 나라에서도 유의한 성차가 나타나지 않았으며, 이는 동아시아권 국가의 수학 성취도 성차가 서구권 국가의 수학 성취도 성차와는 다른 양상을 나타낸다는 점을 시사한다. 이 연구는 동아시아권 국가의 수학 성취도 성차가 수학 능력 수준에 따라 달리 나타날 수 있음을 보였다. 또한, 단순 평균을 비교한 결과와 다층모형에서 얻어진 결과의 차이점을 제시함으로써 성차를 연구함에 있어 주요 변수의 통제가 미치는 영향을 실증하였다.

Keywords: Mathematics Achievement; Gender Gap; TIMSS 2015; Multilevel Modeling

Keywords: 수학 성취도; 성별 차이; TIMSS 2015; 다층모형

I. Introduction

The gender gap in mathematics achievement has been one of the most controversial issues in the field of mathematics over the last few decades. One reason to be concerned about differences in mathematics achievement between female and male students is that mathematics achievement can be used as a promising predictor of future income (Murnane, Willett, & Levy, 1995; Niederle & Vesterlund, 2010); furthermore, Mitra (2002) found that higher income tends to be guaranteed for women with above-average mathematical skills. The author also found that the wage differences between females and males were significant across all educational levels, but this difference disappeared for those females and males who have higher mathematical skills.

Most commonly, the gender difference between females and males has decreased over the years, but the difference becomes much more apparent at higher levels of schooling (Cheema & Galluzzo, 2013; Cimpian, Lubienski, Timmer, Makowski, & Miller, 2016; Friedman, 1989; Ganley & Lubienski, 2016; Guiso, Monte, Sapienza, & Zingales, 2008; Hyde, Lindberg, Linn, Ellis, & Williams, 2008). For example, Cimpian et al. (2016) used Early Childhood Longitudinal Study-Kindergarten Class of 1998-1999 and 2010-2011 and found out that the gender gap emerged early only among higher achievers, but this tendency spread out throughout the others as the grade increased.

Some studies have focused on the evident gender gap in the right tail of mathematical ability (Benbow & Stanley, 1980, 1983; Ellison & Swanson, 2010; Fryer & Levitt, 2010; Robinson & Lubienski, 2011; Wai, Putallaz, & Makel, 2010). Benbow & Stanley (1980, 1983) reported the male-female ratio in the extreme right tail was 13 to 1. Similarly, Ellison & Swanson (2010) showed that the male-female ratio at the 99th percentile exceeded 10 to 1. In contrast, Wai et al. (2010) showed that the gender gap in the right tail of mathematical ability still exists, but the ratio has declined. This body of research indicates that the gender gap should be explored depending on the different levels of mathematical ability.

Caution is required, however, to interpret this gender gap in mathematics achievement since this gap may differ depending on the statistical methodology used. For example, Cheema and Galluzzo (2013) argued that some of the findings from early studies could be problematic for two reasons. First, they did not consider controlling demographic factors (e.g., race or socioeconomic status) which could affect mathematics achievement. Second, the gender gap could not be generalized to the population when the sample was not nationally representative.

This study, therefore, aims to examine the gender gap in mathematics achievement in the Trends in International Mathematics and Science Study (TIMSS) 2015 according of the level of mathematical ability as well as the gender gap among total students using a two-level multilevel modeling. TIMSS 2015 was implemented followed by suggestions from Cheema and Galluzzo (2013), since it consists of nationally representative samples and provides various background information which can be used as control variables. A two-level multilevel model was used due to a hierarchical structure of the data. In addition, the gender gap among eighth grade students was examined based on the results of the previous studies that gender differences tend to be more apparent at higher levels of schooling. Finally, this study focused on the high-achieving countries in TIMSS 2015 mathematics assessment as good examples, which resulted in selecting East Asian countries, including Chinese Taipei, Hong Kong Special Administrative Region (SAR), Japan, Korea, and Singapore. The performance of East Asian countries on TIMSS 2015 was outstanding for both the fourth and eighth grade students, and the gap between these countries and their Western counterparts such as the United States, England, and Canada was more than 50 points (Mullis, Martin, Foy, & Hooper, 2016).

II. Literature Review

1. Gender Gap in Mathematics

Even though many previous studies noted that the gender gap in mathematics achievement was consistent, especially in higher-level skills, other research still suggests that this gap can differ as a result of the features of the data, such as whether a study is a large-scale assessment or whether there are important control variables such as self-efficacy or anxiety. A large-scale assessment enables researchers to generalize the results to the population level because its huge sample size and control variables which are provided in assessments can clarify gender effects on mathematics achievement by considering other important factors. For example, Cheema and Galluzzo (2013) used data from the Program for International Student Assessment (PISA). One of the advantages of the PISA dataset is that it includes background information on students, and this information can be used as control variables. The results indicated that the difference between female and male students in mathematics achievement disappeared when important predictors of mathematics achievement were considered, such as self-efficacy and anxiety related to mathematics. This finding seems intriguing because it suggests that the gender gap comes from emotional aspects such as the negative self-efficacy or anxiety of female students rather than actual differences in intelligence.

As noted above, there is a tendency for the gender gap in mathematics achievement to decrease as female students’ college preparation increases, but the gap between boys and girls appears to widen among high mathematics achievers. Ellison and Swanson (2010) confirmed this tendency using data from American Mathematics Competitions (AMC), which consisted of a series of mathematics contests for high school students. The results indicated that the male-female ratio at the 99th percentile exceeded 10 to 1. Also, the results showed that the magnitude of the variation in the gender gap across schools in the United States was moderate, which implies that the difference in mathematics achievement between boys and girls looks similar from school to school. Finally, the results indicated that the highest-scoring girls’ schools were extremely limited to super-elite schools, whereas the highest-scoring boys came from various backgrounds. The authors speculated that “girls suffer in becoming high achievers in mathematics because they are more compliant with authority figures and/or are more sensitive to social environment” (p. 126).

Similarly, Fryer and Levitt (2010) found a gender gap in mathematics achievement in the upper tail of the distribution. The authors used data from the Early Childhood Longitudinal Study Kindergarten Cohort (ECLS-K). The sample consisted of more than 20,000 children who entered kindergarten in 1998. The participants were re-interviewed at the end of the first year at kindergarten, first grade, third grade, and fifth grade. The results indicated that the proportion of girls in the top 5 percent changed dramatically from 45 percent on entering kindergarten to 28 percent at the end of the fifth grade. Using various information variables from the data, the authors found that the gender difference in mathematics performance did not appear among the female students whose mothers worked in mathematics-related occupations or whose mothers’ educational levels were higher than those of their fathers. This evidence suggests that role models can have a positive effect on girls’ mathematics achievement.

Little research has been done on the gender gap in mathematics achievement of East Asian students, and the results of the studies indicated that the gender gap of these students differs from those of Western countries (Kane & Mertz, 2012; Tsai, Smith, & Hauser, 2018; Tsui, 2007). For example, Tsui (2007) implemented the F-test to examine the gender gap in mathematics scores between China and the United States, especially for talented students. The results showed that the quality of teachers might have an influence on the gender gap. In this study, the gender differences of the top-level students were unclear, but the results indicated that the gender gap at the general level can be related to the quality of the teachers and teaching methods. Specifically, the level of training and overall understanding of mathematical concepts of Chinese mathematics teachers seems to be higher than that of the average mathematics teachers in the United States. Also, this study reported that having a culturally American background might contribute to the acceptance of gender differences in mathematics. Within the culture, teachers hesitate to encourage high mathematics scores for female students, and this culture is rarely observed in China.

Also, Kane and Mertz (2012) used multiple regression to analyze gender differences in mathematics scores, taking into account the religious, cultural, and class styles of 17 countries. In this study, they used various panel data, including PISA, TIMSS and International Mathematical Olympiad (IMO). Among the results, the results of TIMSS 2007 for the eighth grade students showed that there was little evidence that female students show lower scores than male students’ score among three East Asian countries; Korea, Singapore, and Hong Kong SAR. Specifically, in Korea, female students scored slightly lower than male students. However, in all other countries, female students scored higher than male students.

Tsai, Smith, and Hauser (2018) used PISA 2012 to explore the gender gap between three East Asian countries (i.e., Japan, Korea, and Taiwan) and three Western countries (i.e., USA, Germany, and the Czech Republic). In this study, a multilevel multiple indicators and multiple causes (MIMIC; Jöreskog & Goldberger, 1975) model is applied to identify the gender gap for ninth and tenth grade students. The results indicated that while the gender gap was observed in the Western countries, the gap was not observed in the East Asian countries examined in this study.

In contrast, Meisenberg (2016) showed that gender gap in East Asia might be obvious, which is similar to the other countries. In this study, regression models were applied to explore the cultural impact on the gender gap in the three different achievements (i.e., reading, science, and mathematics) for 15-year-olds in 75 countries using PISA 2000-2012 data sets. The author divided the 75 countries into 8 groups based on the cultural similarities (i.e., Protestant Europe, Catholic Europe/Mediterranean, English-speaking countries, Ex-communist countries, Latin America, North Africa/Middle East, South/Southeast Asia, and East Asia). The results showed that the mean score of the female students was consistently lower than that of male students for mathematics and science. Especially, this study showed that the gender gap related to mathematics in East Asia was not significantly smaller than the gender gap of the other countries.

A few limitations still exist in these previous studies which examined gender differences in East Asian countries. To be more precise, Kane and Mertz (2012) did not consider the possibilities that the gender gap may vary depending on level of achievement. Tsui (2007) focused only on the gender differences among talented students and did not control other factors important to mathematics, whereas both Tsai et al. (2018) and Meisenberg (2016) did not consider levels of the mathematical ability when examining the gender gap. This study, therefore, examines gender differences in the East Asian countries for low- and high-achieving students, as well as for total students. At the same time, important variables related to mathematics achievement are included in the two-level multilevel model.

2. Characteristics of Mathematics Education in East Asian Countries

Strayhorn (2010) suggested that the gap in mathematics achievement can be explained by Bronfenbrenner’s (1979) ecological system theory. Within this framework, the ability of growth is accomplished through the system consisted of the four hierarchical sub-systems: microsystem, mesosystem, exosystem, and macrosystem. Specifically, the most inclusive one called as macrosystem denotes the socio-cultural factors such as historical trends. The exosystem means the policy or faculty curriculum regarding the schools. The mesosystem typically implies the parental involvement. The last one called as the microsystem represents the psychological traits (e.g., self-concept, self-efficacy, and etc.). These four subsystems interact with each other and have an impact on not only the academic achievement itself but also the achievement gap among some different demographic groups.

Also, as discussed in the previous section, the international gender gap studies commonly used large-scale assessment including PISA and TIMSS. In the studies, they generally include various variables (e.g., psychological variable, socio-economic status variables, and school-related variable) related to the achievement, and these variables are well matched with the system referred above. Thus, it might be a useful perspective to draw meaningful characteristics of East Asian mathematics education when the results of the previous studies are summarized at the individual level, the family level, and the school level.

At the individual level, the achievement level of East Asian students is generally very high. Specifically, Ho (2009) found that students in Japan, Korea, Hong Kong, and Macao showed higher mathematics achievement than the OECD average, using PISA 2003 data. Hojo and Oshio (2012) also showed that students from Japan, Korea, Taiwan, Hong Kong, and Singapore obtained high mathematics achievement compared to the other countries. Results regarding the high mathematics achievement of East Asian students have been continuously reported in recent study that analyzes the factors of high PISA achievement among East Asian students (Jerrim, 2015). However, in contrast to their high achievement, these students showed relatively low self-concept and self-efficacy (Ho, 2009; Leung, 2010; Shen, 2007).

At the family level, parents in East Asian countries have a high level of interest in their children’s education and have invested heavily in the achievement of their children (Jerrim, 2015). It is an interesting result because the educational level of East Asian parents is relatively low compared to the level of parents from Western countries (Shen, 2007).

At the school level, the quality of teachers is much higher than those of Western countries (Wößmann, 2005).

However, even in the same East Asian countries, there are some differences. First, there are differences in students’ learning strategies. According to Ho (2009), students’ learning strategies and learning environments in Chinese countries such as Macao and Hongkong are better than those in Japan and Korea. And although the mathematics anxiety of East Asian students is generally high, this anxiety is especially high in Japan and Korea when compared to Chinese countries. Inversely, self-concept was lower in both Korea and Japan than in the two Chinese countries. There was also a difference in intrinsic motivation, which showed that Chinese students had higher intrinsic motivation. It is also found that the students in Singapore have more positive attitudes toward mathematics and students’ self-concepts than those in the other East Asian countries (Leung, 2010). Also, Hojo and Oshio (2012) presented different results regarding the scores. According to the results of this study, Japanese students’ mathematics scores were more uniform than those of China, Korea, Taiwan, Hong Kong, and Singapore.

III. Methods

1. Data Sources

The data was obtained from the TIMSS 2015. TIMSS is “a comparative assessment of the achievement of students in many countries” (Glynn, 2012, p. 1321) and consists of mathematics and science assessments for fourth and eighth grade students.

2. Measures

<Table III-1> and <Table III-2> present the student- and school-level variables used in this study, respectively. The outcome measures investigated were five plausible values of mathematics achievement provided in TIMSS 2015. In the TIMSS achievement tests, a subset of items is used to estimate a student’s ability, as it is almost impossible for students to answer all TIMSS achievement items (e.g., 211 mathematics items for eighth grade) in the limited testing time. Instead of calculating a point estimation for a student’s ability, the plausible value methodology (Mislevy, 1991) estimates the ability distribution from the responses to the subset of items and draws a fixed number of individual estimates called the plausible values randomly from the estimated ability distribution. These five random numbers reflect the uncertainty of the student’s proficiency in mathematics (Foy & Yin, 2016).

Table III-1. Student-Level Variables

Varible	Description
mathematics Achievement	Five plausible values (BSMMAT01, BSMMAT02, BSMMAT03, BSMMAT04, BSMMAT05)
Gender	1=male, 0=female
Science Achievement	1st plausible value (BSSSCI01)
Home Educational Resources	- Number of books in the home - Number of home study supports - Highest level of education of either parent
Like Learning Mathematics	- I enjoy learning mathematics - I wish I did not have to study mathematics^{- Mathematics is boring^{- I learn many interesting things in mathematics - I like mathematics - I like any schoolwork that involves numbers - I like to solve mathematics problems - I look forward to mathematics class - Mathematics is one of my favorite subjects}}
Student’s view on Engaging Teaching in Mathematics Lessons	- I know what my teacher expects me to do - My teacher is easy to understand - I am interested in what my teacher says - My teacher gives me interesting things to do - My teacher has clear answers to my questions - My teacher is good at explaining mathematics - My teacher lets me show what I have learned - My teacher does a variety of things to help us learn - My teacher tells me how to do better when I make a mistake - My teacher listens to what I have to say
Confidence in Mathematics	- I usually do well in mathematics - Mathematics is more difficult for me than for many of my classmates^{- Mathematics is not one of my strengths^{- I learn things quickly in mathematics - Mathematics makes me nervous^{- I am good at working out difficult mathematics problems - My teacher tells me I am good at mathematics - Mathematics is harder for me than any other subject^{- Mathematics makes me confused}}}}
Value Mathematics	I think learning mathematics will help me in my daily life - I need mathematics to learn other school subjects - I need to do well in mathematics to get into the university of my choice - I need to do well in mathematics to get the job I want - I would like a job that involves using mathematics - It is important to learn about mathematics to get ahead in the world - Learning mathematics will give me more job opportunities when I am an adult - My parents think that it is important that I do well in mathematics - It is important to do well in mathematics

^* Note. Reverse coded

Download Excel Table

Table III-2. School-Level Variables

Varible	Description
Emphasis on Academic Success	- Teachers’ understanding of the school’s curricular goals - Teachers’ degree of success in implementing the schools’ curricular - Teachers’ expectations for student achievement - Teachers’ ability to inspire students - Parental involvement in school activities in school activities - Parental commitment to ensure that students are ready to learn - Parental support for student achievement - Students’ desire to do well in school - Students’ ability to reach school’s academic goals - Students’ respect for classmates who excel in school
School Discipline Problems	- Arriving late at school - Absenteeism - Classroom disturbance - Cheating - Profanity - Vandalism - Theft - Intimidation or verbal abuse among students - Physical injury to other students - Intimidation or verbal abuse of teachers or staff - Physical injury to teachers or staff
Location	1=Urban and Suburban, 0=Else

Download Excel Table

For the gender variable, males were coded as 1, and females were coded as 0. Thus, the estimate of gender effects indicates how male students perform better/worse on the TIMSS mathematics achievement test as compared to female students.

The control variables, all of the independent variables apart from gender, were selected based on both theoretical and empirical consideration. The control variables except science achievement and school location were the scales which combined a set of TIMSS context questionnaire items. TIMSS also provides cut scores for each scale which makes it easy to interpret the scale scores. For the student level, the following variables were selected as the control variables: science achievement, home educational resources, like learning mathematics, a student’s view on engaging teaching in mathematics lessons, confidence in mathematics, and value mathematics. For the school-level, emphasis on academic success, school disciplinary problems, and location were selected. For school disciplinary problems, the higher the score, the lower the number of problems.

3. Participants

In this study, gender differences were examined among low-, and high-achieving students as well as the total students in order to see if the gender gap varies according to mathematics achievement. Therefore, the sample was divided into two groups: low- and high-achieving students, using the first and third quartile scores of the five plausible values for mathematics achievement. More specifically, all of the five plausible values of the low-achieving students were equal or lower than the first quartile score of the plausible values, whereas all of the five plausible values of the high-achieving students were equal or higher than the third quartile score. Since all plausible values rather than a single score were used to classify the groups, the actual proportions of the low- and high-achieving students were much smaller than 25%. For example, the proportion of the low-achieving students in Chinese Taipei was 18.4% and the proportion of high-achieving students in Chinese Taipei was 13.4%; the number of students and schools in each country are shown in <Table III-3>.

Table III-3. Sample Size in Each Country

		Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
		N	%	N	%	N	%	N	%	N	%
Student	Total	5,681	100.0	3,893	100.0	4,713	100.0	5,309	100.0	5,945	100.0
	Low-Ach	1,044	18.4	721	18.5	809	17.2	950	17.9	1,202	20.2
	High-Ach	761	13.4	429	11.0	696	14.8	653	12.3	602	10.1
School	Total	189	100.0	124	100.0	146	100.0	150	100.0	162	100.0
	Low-Ach	175	92.6	87	70.2	136	93.2	148	98.7	108	66.7
	High-Ach	155	82.0	82	66.1	136	93.2	133	88.7	85	52.5

Download Excel Table

4. Analysis Procedure

A two-level multilevel model was used to examine the gender differences in mathematics achievement of total, low-, and high-achieving students due to the hierarchical structure of the TIMSS 2015.

First, the unconditional model (i.e., a model with no predictors) was produced to provide the information of variance component for mathematics achievement. The student-level model is given by

Y i j = β 0 j + e i j,

where Y_ij is the mathematics achievement of student i in school j, β_0j is the school mean achievement score of school j, and e_ij is the unique error of student i in school j.

The school-level model describes the school mean achievement score which is randomly around a grand mean and is explained as

β 0 j = γ 00 + u 0 j,

where γ₀₀ is the grand mean achievement score for all schools, and u_0j is the unique effect of school j.

The conditional model was used to examine gender effects after controlling all of the independent variables which are associated with the mathematics achievement. The student-level model is given by

Y i j = β 0 j + ∑ p = 1 P X p i j β p j + e i j,

where β_0j is the school mean achievement score for female students after all the independent variables are controlled; the female students were coded as 0 and the male students as 1. X_pij is the value of the student-level control variable for student i in school j.

Similarly, the school-level model includes school-level control variables for the grand mean, which is described as

β 0 j = γ 00 + ∑ q = 1 Q Z q j γ 0 q + u 0 j, β p j = γ p 0,

where γ₀₀ is the grand mean achievement score for female students after all of the independent variables are controlled, and Z_qj is the value of the school-level control variable for school j. The second equation of the student-level model implies that all of the explanatory variables are inserted in the intercept coefficient, since this study focuses on the gender gap when possible influencing factors are controlled.

This study used SPSS 21.0 to analyze descriptive statistics and compute the mean achievement scores. Specifically, the SPSS macro program created by the IEA IDB Analyzer (IEA, 2016) was required to compute the mean achievement scores due to the five plausible values for mathematics achievement. For the two-level multilevel modeling, HLM 6 (Randenbush, Byrk, Cheong, & Congdon, 2004) was used due to the hierarchical structure of TIMSS 2015. Also, the plausible values can be easily analyzed with this software.

IV. Results

1. Descriptive Statistics

<Table IV-1>, <Table IV-2>, and <Table IV-3> provide the means and standard deviations of the measures used in this study for total, low-, and high-achieving students, respectively. As can be seen in these tables, there is a tendency for most control variables to be positively associated with the level of mathematical ability. For example, confidence in mathematics for high-achieving students was the highest, followed by total students and then by low-achieving students.

Table IV-I. Descriptive Statistics for Total Students

Variable	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Variable	M	SD	M	SD	M	SD	M	SD	M	SD
Math Ach1	601.23	96.56	589.33	78.04	585.14	88.38	603.19	85.04	614.92	83.07
Math Ach2	601.81	96.40	589.50	78.65	585.92	88.80	605.07	84.91	616.45	83.26
Math Ach3	601.99	97.24	591.30	78.97	586.13	88.62	604.48	85.13	616.75	83.11
Math Ach4	602.37	98.18	589.94	80.80	585.28	88.99	603.63	85.20	615.73	83.22
Math Ach5	601.81	96.15	590.35	79.58	585.63	89.83	603.90	84.95	617.32	83.55
Gender	0.51	0.50	0.53	0.50	0.49	0.50	0.51	0.50	0.51	0.50
Science Ach	571.93	82.60	542.34	72.78	569.96	75.25	553.84	77.10	591.47	87.98
Home Edu Res	10.37	1.65	10.09	1.63	10.97	1.49	11.58	1.60	10.29	1.59
Like Math	9.25	1.85	9.49	1.94	9.20	1.66	9.12	1.69	10.1	1.88
Eng Teaching	9.25	1.77	9.34	2.01	8.50	1.43	8.43	1.52	9.76	1.78
Conf in Math	9.12	2.39	9.38	2.20	8.96	1.95	9.43	1.83	9.72	2.17
Value Math	8.16	1.74	8.68	1.96	8.47	1.52	8.59	1.62	9.67	1.73
Emp on Acad	9.91	1.95	9.54	1.83	9.78	1.56	11.10	1.86	10.61	1.71
Discipline Prob	11.13	1.69	11.32	1.48	10.47	1.78	11.02	2.02	11.57	1.55
Location	0.54	0.50	0.85	0.35	0.38	0.49	0.55	0.50	1.00	0.00

Download Excel Table

Table IV-2. Descriptive Statistics for Low-Achieving Students

Variable	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Variable	M	SD	M	SD	M	SD	M	SD	M	SD
Math Ach1	449.74	52.97	469.59	51.84	454.44	47.30	476.15	47.32	488.89	51.54
Math Ach2	450.48	53.05	468.46	52.47	454.24	47.99	478.29	46.13	489.91	51.33
Math Ach3	449.51	55.02	469.92	52.14	453.88	45.83	477.18	47.77	490.97	52.68
Math Ach4	448.62	55.76	464.87	56.47	454.15	48.22	476.76	47.84	489.32	51.48
Math Ach5	451.53	53.83	467.92	55.98	452.35	47.19	476.36	46.68	490.07	51.70
Gender	0.52	0.50	0.54	0.50	0.52	0.50	0.54	0.50	0.58	0.49
Science Ach	453.71	59.86	448.57	65.80	469.97	56.69	457.13	56.53	466.96	65.97
Home Edu Res	9.22	1.46	9.44	1.57	10.09	1.39	10.72	1.71	9.37	1.57
Like Math	8.12	1.67	8.57	1.92	8.28	1.55	8.14	1.59	9.31	1.85
Eng Teaching	8.58	1.90	8.90	2.34	8.05	1.53	7.89	1.64	9.50	1.92
Conf in Math	7.43	1.93	8.32	1.91	7.75	1.90	8.08	1.64	8.51	1.86
Value Math	7.29	1.93	8.04	2.30	7.86	1.72	7.75	1.68	9.41	1.88
Emp on Acad	9.79	1.84	9.14	1.74	9.65	1.50	11.08	1.86	10.12	1.53
Discipline Prob	11.13	1.70	11.01	1.45	10.46	1.78	10.99	2.02	11.22	1.46
Location	0.53	0.50	0.84	0.37	0.35	0.48	0.45	0.50	1.00	0.00

Download Excel Table

Table IV-3. Descriptive Statistics for High-Achieving Students

Variable	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Variable	M	SD	M	SD	M	SD	M	SD	M	SD
Math Ach1	728.21	35.58	697.84	30.51	712.19	40.30	725.00	37.87	725.71	28.95
Math Ach2	727.16	36.25	699.47	30.98	712.11	40.61	728.10	39.12	726.32	28.11
Math Ach3	728.78	35.89	700.16	32.31	713.25	40.54	727.72	37.68	726.50	28.80
Math Ach4	728.23	37.16	699.12	31.44	713.43	40.16	725.12	39.02	725.66	28.27
Math Ach5	726.39	35.92	698.82	33.32	713.14	40.36	725.75	36.94	727.19	28.09
Gender	0.56	0.50	0.66	0.47	0.50	0.50	0.58	0.49	0.51	0.50
Science Ach	668.59	40.78	625.76	45.28	655.85	45.92	652.63	50.37	696.14	41.69
Home Edu Res	11.40	1.53	10.86	1.63	11.87	1.34	12.50	1.33	11.19	1.39
Like Math	10.55	1.63	10.78	1.69	10.11	1.64	10.39	1.56	11.19	1.68
Eng Teaching	9.93	1.68	9.64	1.81	8.82	1.39	9.05	1.52	10.21	1.68
Conf in Math	11.25	2.05	11.24	1.99	10.42	1.83	11.22	1.74	11.38	1.93
Value Math	9.16	1.59	9.54	1.72	8.91	1.36	9.72	1.66	10.01	1.64
Emp on Acad	10.10	1.96	10.01	1.83	9.86	1.56	11.24	1.77	11.30	1.66
Discipline Prob	11.17	1.65	11.55	1.46	10.52	1.78	10.98	2.05	12.01	1.52
Location	0.60	0.49	0.84	0.37	0.40	0.49	0.47	0.50	1.00	0.00

Download Excel Table

There was no consistency in the proportions of female and male students in each group. The proportions remained around 0.5 in the total sample, which was intended by the sampling procedure in TIMSS 2015. The low-achieving group has a larger number of male students in every country, but the difference in the proportions was negligible. For high-achieving students, there were more male students in Chinese Taipei, Hong Kong SAR, and Korea. Again, the difference in the proportions was small.

2. Gender Difference in Mathematics Achievement

Prior to the multilevel modeling analysis, the mean achievement scores of female and male students were simply compared using the t-test. As shown in <Table VI-4>, the gender differences for total students were insignificant except in Singapore. The mean score of the female students was almost 10 points higher than that of the male students in Singapore. For low-achieving students, a gender gap was present in Hong Kong SAR and Singapore. In both countries, the mean score of the female students was higher than that of the male students. For high-achieving students, a gender difference was detected in Hong Kong SAR. The mean score of the male students was 13 points higher than that of female students.

Table IV-4. Mean Comparison of Mathematics Achievement for the Female and the Male Students

Group	Country	Female		Male		Difference
Group	Country	M	SD	M	SD	Difference
Total	Chinese Taipei	599.07	2.57	599.14	3.00	−0.07
	Hong Kong SAR	591.47	4.70	596.77	5.99	−5.30
	Japan	587.56	3.15	585.34	3.04	2.22
	Korea	605.04	2.65	606.37	3.15	−1.33
	Singapore	625.75	3.41	616.44	3.80	9.31
Low	Chinese Taipei	452.18	2.91	446.01	3.58	6.17
	Hong Kong SAR	476.38	5.05	464.06	5.86	12.32
	Japan	453.42	3.20	454.34	2.92	−0.92
	Korea	479.62	2.66	475.38	2.99	4.24
	Singapore	496.65	4.08	488.01	3.59	8.64
High	Chinese Taipei	725.42	2.72	728.16	3.28	−2.74
	Hong Kong SAR	689.32	4.94	702.86	2.73	−13.54
	Japan	713.96	2.88	712.72	3.02	1.24
	Korea	724.18	3.84	728.30	2.64	−4.12
	Singapore	722.86	2.29	729.60	2.72	−6.74

^** p<.01,

^* p<.05

Download Excel Table

The variance components obtained from the unconditional model are summarized in <Table IV-5>. The intraclass correlation (ICC) was also computed to measure the magnitude of the variance between schools in the overall variance. The results revealed that between-school variance accounts for 8.8% to 52.3% of the total sample, 0.7% to 33.1% of the low-achieving group, and 3.0% to 10.5% of the high-achieving group. The between-school variance in Hong Kong SAR was much larger than that of the other countries. Next, a two-level multilevel model was applied to all of the samples, although a couple of subsamples showed rather small between-school variance (e.g., 0.7% of the low-achieving group of Korea). This is because the multilevel model still accounts for part of the between-school variance, even for these subsamples.

Table IV-5. Variance Components for Unconditional Model

Group	Level	Chinese Taipei	Hong Kong SAR	JAPAN	Korea	Singapore
Total	Student	7274.390	2944.823	6544.614	6622.890	3326.175
	School	1974.017	3769.983	1308.273	639.125	3652.369
	ICC	0.213	0.561	0.167	0.088	0.523
Low	Student	2743.286	2078.533	2151.853	2208.763	1691.206
	School	180.566	728.101	85.598	16.229	837.015
	ICC	0.062	0.259	0.038	0.007	0.331
High	Student	1268.374	907.638	1493.054	1401.303	715.122
	School	39.487	88.414	122.962	52.786	83.696
	ICC	0.030	0.089	0.076	0.036	0.105

^*** p<.001,

^** p<.01,

^* p<.05

Download Excel Table

In the following analysis, the control variables as well as the gender variables were introduced in order to examine the gender gap of total, low-, and high-achieving students after controlling all possible influencing factors on mathematics achievement. The result of the multilevel analysis for total, low-, and high-achieving students are presented in <Table IV-6>, <Table IV-7>, and <Table IV-8>, respectively.

Table IV-6. Estimates of Gender Difference in Mathematics Achievement of Total Students

Fixed Effects	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Fixed Effects	b	SE	b	SE	b	SE	b	SE	b	SE
Intercept	601.029	1.746	594.545	5.954	582.545	1.421	604.768	1.546	621.854	1.857
Student-Level
Gender	−9.721	1.565	−3.252	1.920	−4.815	1.374	−6.760	1.389	−11.262	1.627
Science Achievement	0.852	0.027	0.554	0.033	0.786	0.025	0.740	0.021	0.681	0.023
Home Edu Resources	0.766	0.625	−0.845	0.647	4.156	0.639	2.313	0.525	−0.269	0.436
Like Math	4.046	0.850	4.159	0.781	1.612	0.742	2.859	0.740	4.441	0.509
Engaging Teaching	−1.926	0.548	−1.162	0.547	−2.857	0.755	−2.684	0.561	−0.950	0.509
Confidence in Math	8.310	0.712	6.499	0.472	9.955	0.778	11.208	0.620	5.565	0.410
Value Math	−0.086	0.563	0.131	0.522	1.012	0.638	1.483	0.625	−1.835	0.433
School-Level
Emp on Acad Success	2.353	0.543	5.558	1.699	4.768	1.056	2.679	0.504	2.954	0.787
Discipline Problem	0.557	0.645	4.086	2.059	0.466	0.816	−1.060	0.416	1.653	0.866
Location	8.616	2.007	−7.653	6.572	13.265	2.958	5.279	1.851	−	−
Random Effects
Between Students	1792.063		1485.928		2069.737		2042.137		1286.645
Between Schools	103.194		869.514		164.089		34.900		153.701
ICC	0.054		0.369		0.073		0.017		0.107
Variance Explained (%)
Between Students	75.36		49.54		68.37		69.17		61.32
Between Schools	94.77		76.94		87.46		94.54		95.79

^*** p<.001,

^** p<.01,

^* p<.05

Download Excel Table

Table IV-7. Estimates of Gender Difference in Mathematics Achievement of Low-Achieving Students

Fixed Effects	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Fixed Effects	b	SE	b	SE	b	SE	b	SE	b	SE
Intercept	452.539	2.961	485.960	5.572	453.520	2.886	478.800	2.815	500.150	2.638
Student-Level
Gender	−8.908	2.863	−11.242	4.098	0.835	4.058	−3.572	3.133	−9.604	2.739
Science Achievement	0.597	0.051	0.473	0.039	0.449	0.060	0.464	0.048	0.488	0.030
Home Edu Resources	0.829	1.043	−0.015	1.273	0.946	1.473	1.583	0.838	0.188	0.916
Like Math	3.550	1.768	2.438	1.702	2.846	2.073	2.133	1.138	4.605	1.232
Engaging Teaching	−0.780	1.024	−0.163	1.428	−0.858	1.457	−1.499	1.105	−2.026	0.762
Confidence in Math	2.672	1.048	2.024	1.236	1.085	1.583	4.573	1.172	4.389	0.997
Value Math	−1.726	1.141	0.921	1.260	0.963	1.383	0.672	0.975	−0.943	0.762
School-Level
Emp on Acad Success	0.491	1.010	1.100	1.644	1.856	1.241	0.712	1.067	−0.078	1.372
Discipline Problem	1.824	0.965	2.159	1.908	−0.943	1.204	−0.979	0.930	1.513	1.513
Location	7.059	3.129	−8.100	6.337	1.250	3.377	0.852	3.029	−	−
Random Effects
Between Students	46.642		197.372		27.828		7.186		160.943
Between Schools	1569.871		1424.135		1482.986		1445.276		1094.066
ICC	0.029		0.122		0.018		0.005		0.128
Variance Explained (%)
Between Students	74.17		72.89		67.49		55.72		80.77
Between Schools	42.77		31.48		31.08		34.57		35.31

^*** p<.001,

^** p<.01,

^* p<.05

Download Excel Table

Table IV-8. Estimates of Gender Difference in Mathematics Achievement of High-Achieving Students

Fixed Effects	Chinese Taipei		Hong Kong SAR		Japan		Korea		Singapore
Fixed Effects	b	SE	b	SE	b	SE	b	SE	b	SE
Intercept	726.450	2.717	697.382	7.545	705.572	3.795	725.639	4.559	722.415	2.444
Student-Level
Gender	−2.709	4.167	1.831	3.724	−3.604	3.586	−1.126	3.452	2.008	2.895
Science Achievement	0.297	0.112	0.211	0.092	0.294	0.091	0.285	0.092	0.188	0.095
Home Edu Resources	0.372	0.882	0.079	1.838	3.785	1.448	0.246	1.653	−0.056	0.988
Like Math	0.874	1.548	0.848	1.595	−0.921	1.852	−0.255	2.140	1.327	1.092
Engaging Teaching	−2.163	1.110	−1.476	1.091	0.465	1.342	−0.177	1.697	0.304	1.053
Confidence in Math	3.277	1.324	2.850	1.264	5.193	1.773	4.190	1.191	1.895	1.079
Value Math	−0.153	1.009	−0.121	1.533	−0.311	1.651	−1.121	1.283	−0.574	0.872
School-Level
Emp on Acad Success	0.357	0.794	0.767	1.369	3.323	1.199	1.184	0.974	1.338	1.308
Discipline Problem	0.541	1.082	1.950	1.765	0.261	1.245	0.009	0.840	1.536	1.019
Location	2.282	3.536	−3.250	7.413	9.403	3.684	1.235	4.661	−	−
Random Effects
Between Students	39.283		63.732		52.965		13.441		34.328
Between Schools	1060.896		781.357		1215.105		1188.848		646.342
ICC	0.036		0.075		0.042		0.011		0.050
Variance Explained (%)
Between Students	0.52		27.92		56.93		74.54		58.99
Between Schools	16.36		13.91		18.62		15.16		9.62

^*** p<.001,

^** p<.01,

^* p<.05

Download Excel Table

For the total sample, the gender gap in mathematics achievement was significant except in Hong Kong SAR, even controlling for other variables (-9.721 in Chinese Taipei, -4.815 in Japan, -6.760 in Korea, and -11.252 in Singapore). In accordance with Kane and Mertz (2012), the mean achievement score of female students was higher than that of male students in those countries. For example, the gender gap was –9.721 in Chinese Taipei, which indicates that the mean achievement score of female students was 9.721 higher than that of male students. The surprising aspect of the results is that those gender differences were not identified when the t-test was used, except for Singapore.

For low-achieving students, a significant gender gap was detected in Chinese Taipei, Hong Kong SAR, and Singapore (-8.908 in Chinese Taipei, -11.242 in Hong Kong SAR, and -9.604 in Singapore). Similar to the total students’ results, the mean achievement score of female students was higher than that of male students. It is interesting that the gender difference in Hong Kong SAR was insignificant for all students but was significant for the low-achieving students. Again, the gender differences in Chinese Taipei were not detected in the t-test.

For high-achieving students, a gender gap was not present, which is consistent with Tsui (2007). It should be noted that this result conflicts with the consensus from the Western countries that more evident gap between boys and girls was detected at the right tail of mathematical ability (Benbow & Stanley, 1980, 1983; Ellison & Swanson, 2010; Fryer & Levitt, 2010; Robinson & Lubienski, 2011; Wai et al., 2010). Similar to the results from total and low-achieving students, the results were not matched to those from the t-test.

Interestingly, there was no gender effect either in low- or high-achieving students in Japan and Korea, even though the gender gap of the total students in both countries was significant. More importantly, a gender gap for low-achieving students was not present in these two countries, while the gender gap in other countries was almost 10 points. In accordance with previous research (Ho, 2009), these two countries showed similar characteristics in control variables. Especially, the students in Japan and Korea showed relatively high scores for home educational resources and relatively lower scores for engaging teaching than other countries. This finding implies that these two factors may be related to the small achievement gap between the female and male students in low-achieving group.

The impact of each control variable differed by sample. More specifically, science achievement had a significant impact on mathematics achievement for both total and low-achieving students. The effect of home educational resources was significant only for the total sample. The degree to which a student likes mathematics was significant for the total students in most of the countries, whereas it was insignificant for the low- and high-achieving students, apart from the low-achieving students in Singapore. The effect of confidence in mathematics was significant regardless of the sample in most countries. Emphasis on academic success affected mathematics achievement in all countries only for the total sample. The impacts of a school’s disciplinary problems and a school’s location were significant in some countries.

The impact of most variables was positive except for students’ views on engaging teaching in mathematics lessons. In other words, the more students highly evaluated engaged teaching in mathematics lessons, the lower their achievement. It is intriguing that this impact was not significant in Singapore, as the scale score for a student’s view on engaging teaching of the students in Singapore was higher than other countries.

V. Conclusion

The gender gap between female and male students in mathematics achievement has been studied over the past decades. The consensus derived from previous research is that gender differences are more apparent in high-achieving students. Cheema and Galluzzo (2013) pointed out, however, that this gender gap can be regarded as the “mixed effect” of actual gender differences and the impact of other important variables to mathematics achievement. The controversy over the gender gap in mathematics achievement implies two things: first, a gender gap may vary depending on levels of mathematical ability. Second, researchers need to consider the influencing variables on achievement. This study, therefore, investigated the gender differences of total, low-, and high-achieving students in high-achieving countries in TIMSS 2015, which includes Chinese Taipei, Hong Kong SAR, Japan, Korea, and Singapore. In consideration of the hierarchical structure of the data, a two-level multilevel model was implemented.

The main results of this study are summarized as follows. First, gender differences in mathematics achievement in East Asian countries were different depending on the level of mathematical ability. Specifically, for total students, this study has shown that gender differences in mathematics achievement were present in most East Asian countries except for Hong Kong SAR. However, for low-achieving students, of the four countries which showed a gender gap for total students, two countries, Chinese Taipei and Singapore, showed a gender gap. It is interesting that gender differences were not detected for total students but that a significant gender gap was detected for low-achieving students in Hong Kong SAR. For high-achieving students, this study showed that the gender gap was insignificant in all countries in East Asia, which conflicts with the consensus derived from Western countries that the gender difference is more apparent among high-achievers (Benbow & Stanley, 1980, 1983; Ellison & Swanson, 2010; Fryer & Levitt, 2010; Robinson & Lubienski, 2011; Wai et al., 2010).

Second, on the question of proper methodology for examining gender differences, this study found that there were clear differences between the results from the mean comparison and those from the two-level multilevel modeling. To be more precise, for total students, the result from the mean comparison found gender gap only in Singapore, whereas the result from the two-level multilevel model found gender gap in Chinese Taipei, Japan, Korea, and Singapore. For low-achieving students, the result from the mean comparison found gender gap in Hong Kong SAR and Singapore, whereas the result from the two-level multilevel model found gender gap in Chinese Taipei, Hong Kong SAR, and Singapore. For high-achieving students, the result from the mean comparison fount gender gap in Hong Kong SAR, whereas the result from the two-level multilevel model found no gender gap in East Asian countries. This finding confirms the idea set forth by Cheema and Galluzzo (2013) that the gender gap needs to be investigated alongside other important variables.

This study contributes additional evidence that gender differences in mathematics achievement should be examined with proper statistical methodology, which was suggested by Cheema and Galluzzo (2013). Also, this study found no gender gap in high-achieving students in East Asian countries, which was different from that in Western countries where more apparent gender difference in high-achieving students was found.

The limitations of this study and suggestions for future research are as follows. First, the limitation of this study lies in the fact that it investigated the gender gap in eighth grade students. Further research is recommended to investigate the gender differences among high school students, which can be directly related to college entrance examinations. Second, more various explanatory variables need to be considered, especially for low- and high-achieving students, considering that the effects of the control variables for those groups were not significant in this study. Third, this study might lead to the conclusion that there is little evidence of a gender gap for high-achievement students. However, according to a previous study, it was found that female students spend 19.5% more time studying mathematics than do male students (Guiso et al., 2008). In other words, not only the difference in scores between the male and the female students, but also learning strategies for how to efficiently study mathematics need to be examined to produce more in-depth explanations of the gender gap in mathematics achievement. Finally, further research should be done to investigate the possible reasons of gender differences in East Asian countries especially among low-achieving students, which may help assist and support these students.

References

Benbow, C. P., & Stanley, J. C. (1980). Sex differences in mathematical ability: Fact or artifact? Science, 210, 1262-1264. .

Benbow, C. P., & Stanley, J. C. (1983). Sex differences in mathematical ability: More facts. Science, 222, 1029-1031. .

Bronfenbrenner, U. (1979). The Ecology of Human Development. Cambridge, MA: Harvard University Press.

Cheema, J., & Galluzzo, G. (2013). Analyzing the gender gap in math achievement: Evidence from a large-scale US sample. Research in Education, 90(1), 98-112. .

Cimpian, J. R., Lubienski, S. T., Timmer, J. D., Makowski, M. B., & Miller, E. K. (2016). Have gender gaps in math closed? Achievement, teacher perceptions, and learning behaviors across two ECLS-K cohorts. AERA Open, 2(4), 1-19. .

Ellison, G., & Swanson, A. (2010). The gender gap in secondary school mathematics at high achievement levels: Evidence from the American Mathematics Competitions. The Journal of Economic Perspectives, 24(2), 109-128. .

Foy, P., & Yin, L. (2016). Scaling the TIMSS 2015 Achievement Data. In M. O. Martin, I. V. S. Mullis, & M. Hooper (Eds.). Methods and Procedures in TIMSS 2015 (pp. 13.1-13.62). Retrieved from Boston College, TIMSS & PIRLS International Study Center website: http://timssandpirls.bc.edu/publications/timss/2015-methods.html.

Friedman, L. (1989). Mathematics and the gender gap: A meta-analysis of recent studies on sex differences in mathematical tasks. Review of Educational research, 59(2), 185-213. .

Fryer, R. G., & Levitt, S. D. (2010). An empirical analysis of the gender gap in mathematics. American Economic Journal: Applied Economics, 2(2), 210-240. .

10.

Ganley, C. M., & Lubienski, S. T. (2016). Mathematics confidence, interest, and performance: Examining gender patterns and reciprocal relations. Learning and Individual Differences, 47, 182-193. .

11.

Glynn, S. M. (2012). International assessment: A Rasch model and teachers’ evaluation of TIMSS science achievement items. Journal of Research in Science Teaching, 49(10), 1321-1344. .

12.

Guiso, L., Monte, F., Sapienza, P., & Zingales, L. (2008). Culture, gender, and math. Science, 320, 1164-1165. .

13.

Ho, E. S. C. (2009). Characteristics of East Asian learners: What we learned from PISA. Educational Research Journal, 24(2), 327-348..

14.

Hojo, M., & Oshio, T. (2012). What factors determine student performance in East Asia? New Evidence from the 2007 Trends in International Mathematics and Science Study. Asian Economic Journal, 26(4), 333-357. .

15.

Hyde, J. S., Lindberg, S. M., Linn, M. C., Ellis, A. B., & Williams, C. C. (2008). Gender similarities characterize math performance. Science, 321, 494-495. .

16.

International Association for the Evaluation of Educational Achievement (IEA) (2016). IDB Analyzer (Version 3.2) [Computer software]. Hamburg, Germany: IEA..

17.

Jerrim, J. (2015). Why do East Asian children perform so well in PISA? An investigation of Western-born children of East Asian descent. Oxford Review of Education, 41(3), 310-333. .

18.

Jöreskog, K. G., & Goldberger, A. S. (1975). Estimation of a model with multiple indicators and multiple causes of a single latent variable. Journal of the American Statistical Association, 70(351a), 631-639. .

19.

Kane, J. M., & Mertz, J. E. (2012). Debunking myths about gender and mathematics performance. Notices of the American Mathematical Society, 59(1), 10-21. .

20.

Leung, F. K. S. (2010). Behind the high achievement of East Asian students. Educational Research and Evaluation, 8(1), 81-108..

21.

Meisenberg, G. (2016). Gender differences in school achievement across cultures: An analysis of results from PISA 2000-2012. Mankind Quarterly, 57(2), 227-251..

22.

Mislevy, R. J. (1991). Randomization-based inference about latent variables from complex samples. Psychometrika, 56(2), 177-196. .

23.

Mitra, A. (2002). Mathematics skill and male-female wages. The Journal of Socio-Economics, 31(5), 443-456. .

24.

Mullis, I. V. S., Martin, M. O., Foy, P., & Hooper, M. (2016). TIMSS 2015 International Results in Mathematics. Retrieved from Boston College, TIMSS & PIRLS International Study Center website: http://timssandpirls.bc.edu/timss2015/international-results/.

25.

Murnane, R. J., Willett, J. B., & Levy, F. (1995). The growing importance of cognitive skills in wage determination. Review of Economics and Statistics, 77(2), 251-266. .

26.

Niederle, M., & Vesterlund, L. (2010). Explaining the gender gap in math test scores: the role of competition. Journal of Economic Perspectives, 24(2), 129-144. .

27.

Raudenbush, S. W., Byrk, A. S., Cheong, Y., & Congdon, R. (2004). HLM 6 for Windows [Computer Software]. Lincolnwood, IL: Scientific Software International. .

28.

Robinson, J. P., & Lubienski, S. T. (2011). The development of gender achievement gaps in mathematics and reading during elementary and middle school: Examining direct cognitive assessments and teacher ratings. American Educational Research Journal, 48(2), 268-302. .

29.

Shen, C. (2007). How American middle schools differ from schools of five Asian Counties: Based on cross-national data from TIMSS 1999. Educational Research and Evaluation, 11(2), 179-199. .

30.

Strayhorn, T. L. (2010). The role of schools, families, and psychological variables on math achievement of black high school students. The High School Journal, 93(4), 177-194. .

31.

Tsai, S. L., Smith, M. L., & Hauser, R. M. (2018). Gender Gaps in Student Academic Achievement and Inequality. In H. Park & G. Kao (Eds.), Research in the Sociology of Education (Research in the Sociology of Education, Volumn 20) (pp. 181-218). Bingley, England: Emerald Publishing Limited. .

32.

Tsui, M. (2007). Gender and mathematics achievement in China and the United States. Gender Issues, 24(3), 1-11. .

33.

Wai, J., Cacchio, M., Putallaz, M., & Makel, M. C. (2010). Sex differences in the right tail of cognitive abilities: A 30 year examination. Intelligence, 38(4), 412-423. .

34.

Wößmann, L. (2005). The effect heterogeneity of central examinations: Evidence from TIMSS, TIMSS Repeat and PISA. Education Economics, 13(2), 143-169. .