Abstract: It is generally recognized that there are gender-related differences in children’s toy preferences. However, the magnitude of these differences has not been firmly established. Furthermore, not all studies of gender-related toy preferences find significant gender differences. These inconsistent findings could result from using different toys or methods to measure toy preferences or from studying children of different ages. Our systematic review and meta-analysis combined 113 effect sizes from 75 studies to estimate the magnitude of gender-related differences in toy preferences. We also assessed the impact of using different toys or methods to assess these differences, as well as the effect of age on gender-related toy preferences. Boys preferred boy-related toys more than girls did, and girls preferred girl-related toys more than boys did. These differences were large (d ≥ 1.60). Girls also preferred toys that researchers classified as neutral more than boys did (d = 0.29). Preferences for gender-typical over gender-atypical toys were also large and significant (d ≥ 1.20), and girls and boys showed gender-related differences of similar magnitude. When only dolls and vehicles were considered, within-sex differences were even larger and of comparable size for boys and girls. Researchers sometimes misclassified toys, perhaps contributing to an apparent gender difference in preference for neutral toys. Forced choice methods produced larger gender-related differences than other methods, and gender-related differences increased with age.
Discussion
We found a broad consistency of results across the large body of research on children’s gender-related toy preferences: children showed large and reliable preferences for toys that were related to their own gender. Thus, according to our review, gender-related toy preferences may be considered a well-established finding. Our results, with 75 studies and a range of toy preference measurements, complement and extend a previous meta-analysis of 16 studies focused on free play (Todd et al., 2018).However, our meta-analyses also revealed some gaps that could prevent confident inferences about the drivers and consequences of children’s gender-related toy preferences. These gaps could form priority targets for future research. Our analyses also revealed some emergent patterns in the data, especially in how gender-related preferences for broad groups of toys differed in some respects from those for dolls and vehicles, how study results varied according to study method, and how gender-related differences in toy preferences related to child age.
Toy Selection and Gender Categorization
The way that toys are selected, and categorized, as boy-related or girl-related, is not standardized in the present research. Studies in our review appeared to treat the gender categorization of toys as uncontroversial, even though, according to our review, it was not uncommon for toys to be assigned to different gender categories in different studies. For example, in some studies, blocks were classified as boy-related toys (e.g., Alexander & Saenz, 2012; Benenson et al., 1997; Fagot & Patterson, 1969), and in other studies they were classified as neutral toys (e.g., Cherney et al., 2003; Guinn, 1984; Wood, Desmarais, & Gugula, 2002). Similarly, drawing toys were sometimes categorized as girl-related toys (e.g., Berenbaum & Hines, 1992; Martin et al., 2013), and sometimes as neutral toys (e.g., Berenbaum & Snyder, 1995; Pasterski et al., 2005); and stuffed toys were equally likely to be classified as girl-related toys (e.g., DeLucia, 1963; Jacklin et al., 1973) as neutral toys (e.g., Alexander & Saenz, 2012; Idle et al., 1993; Moller & Serbin, 1996), but were also sometimes classified as boy-related toys (e.g., Stagnitti, Rodger, & Clarke, 1997). This pattern suggests that researchers sometimes disagree on what toys are boy-related, girl-related, or neutral.In addition to finding that researchers sometimes disagreed on toy classifications, we also found that researchers typically did not report how they had selected toys for study or how they had assigned the toys to gender categories. We suspect that, in most cases, researchers used a simple heuristic method based on perceived cultural stereotypes. There are two problems with this type of approach. First, as noted above, toys categorized using this approach do not always fall into the same gender category in different studies. If one study includes a stuffed toy in the category “girls’ toys” and another study includes a stuffed toy in the category “neutral toys,” they may well report different results, even if the true underlying effect they are measuring is the same. Second, at its extreme, this problem may manifest as criterion contamination, in which gender-typed toys are defined by the results of the study. That is, the researchers may use many toys and select as “gender-related” toys the ones that they find to be differentially preferred by gender. At best, this tautology limits the generalizability of study results to other samples. At worst, it could invalidate the study.
Using methods that avoid confusion about toy categorization could be a priority for future research on children’s gender-related toy preferences. As also suggested by Fine (2015), this field could benefit from researchers specifying more clearly the ways in which they selected and categorized toys. Depending on the goal of the study, this selection and categorization might be based on different criteria. For example, a study examining whether stereotypes about children’s toy preferences relate to children’s actual preferences, might select toys based on adults’ independent ratings of the gender stereotyping of toys. In contrast, a study of the effect of a particular mechanism, such as social, cognitive, or hormonal influences, on toy preferences might select toys based on prior studies’ findings that certain toys are on average preferred by girls or boys. Overall, the important point is that researchers report more clearly how they selected toys and assigned toys to gender categories.
Researchers also have begun to investigate specific hypotheses about what characteristics of different toys might make them appeal more to boys or to girls. For instance, it has been suggested that color or shape might influence children’s gender-related preferences (e.g., Jadva et al., 2010; Weisgram et al., 2014; Wong & Hines, 2015). Similarly, it has been suggested that affordance of activity, motion, or propulsion might influence these preferences (Alexander & Hines, 2002; Benenson et al., 1997; Hassett et al., 2008; for a review, see Zosuls & Ruble, 2018). To evaluate these suggestions, it would be useful if researchers could provide color images, or full descriptions, of the toys used in the research they report. Similarly, it would be useful for this purpose, as well as for future reviews, if researchers could provide descriptive statistics, including means and SD or similar, by sex, for individual toys used, and not just for toy groupings.
To test whether the meta-analysis results were affected by researchers’ definitions of toy gender, we analyzed the subset of effect sizes that related to a very narrow definition of boy-related toys and girl-related toys: specifically, vehicles and dolls. These toys were the only ones for which sufficient data had been reported to allow reliable meta-analyses. The gender effects observed in the overall meta-analyses were broadly replicated with this more narrowly defined subset of toys, giving us confidence that our overall meta-analytic results were not entirely dependent on how researchers had chosen to categorize toys in regard to gender.
Furthermore, we found that girls’ gender-specific preference for dolls over vehicles was larger than their preference for broadly defined groups of girl-related toys. However, despite the large effect size, girls’ gender-specific preference for dolls over vehicles was not statistically significant, as this effect also showed large meta-analytic statistical variance. The large meta-analytic statistical variance is due to a combination of large variances in girls’ preference for dolls within the studies, variation between studies introducing additional statistical variance, and a smaller total number of studies that reported separate statistics for dolls as compared to broadly defined toy groups. In addition, the broadly defined toy groups included toys that, as mentioned above, were classified as neutral in some studies but girl-related in others. If toys are classified consistently, girls may show gender-related preferences at least as large as those of boys.
Culture and Gender-Related Toy Preferences
Cultural perceptions of play, including play with toys, may differ in different cultural, ethnic, or socioeconomic groups. For example, play is viewed as central to children’s cognitive and social development in many Western, technologically developed societies, but as less important in more traditional societies (Roopnarine, 2010). Children in different cultures may also have different referential concepts for appropriate gender-related behavior, due to cultural variation in gender norms (Pfeiffer & Butz, 2005; Wood & Eagly, 2002). This possibility is particularly relevant to toy preferences, because there may be cultural variations in the toys that are available, culturally relevant, and gender-related.Nevertheless, little empirical research is presently available on cultural variation in gender-related toy preferences. Our review revealed that most toy preference studies focus on the U.S., Canada, the UK, and Australia. Of those studies conducted outside English-speaking industrialized nations, one was conducted in France (Le Maner-Idrissi, 1996), one in Finland (Lamminmäki et al., 2012), four in Sweden (Nelson, 2005; Nordenström, Servin, Bohlin, Larsson, & Wedell, 2002; Serbin et al., 2001; Servin, Bohlin, & Berlin, 1999), and one in the Netherlands (van de Beek et al., 2009). An additional study included some participants from Hungary, along with participants from the UK (Turner & Gervai, 1995). These studies did not report different results to the studies from the English-speaking countries, even when researchers had specifically hypothesized that they would (e.g., Nelson, 2005). In global perspective, however, these countries are very similar in terms of industrialization, wealth, education, media access, democracy, and gender equality. Consequently, children in these countries probably have very similar toys available to them and similar access to information about dominant social stereotypes around these toys. It remains an open question, then, whether children in cultures with radically different stereotype referents and social norms would show the same gender-related toy preferences to those found in the current meta-analysis.
We did not formally investigate other aspects of cultural diversity, such as ethnicity and socioeconomic status, because these also have not received much attention in empirical studies of gender-related toy preferences. Participants in most toy preference studies are not very ethnically diverse, and so it may not be practical to report results by ethnicity. We found three studies (out of our total 75) that reported toy preferences by ethnicity. Two of these studies were conducted in the USA and reported no significant differences in gender-related toy preferences between children of Hispanic and non-Hispanic background (Goble, 2012), or Native American and non-Native American background (Guinn, 1984). In contrast, another study based in the U.S. found that ethnicity might affect children’s preferences for gender-related activities, including play with toys, via children’s social networks (Martin et al., 2013). Furthermore, in recent years, the wider field of gender development research has paid increasing attention to the intersectionality of gender, ethnicity, and other identities (e.g., Shields, 2008). This trend in the wider field may translate in future to more studies investigating gender-related toy preferences in diverse social groups.
Methods of Measuring Toy Preference Are Important
Studies may find different gender effects on children’s toy preferences, depending on the method they use to measure toy preferences. We evaluated four categories of study methods: free play methods, where children were given access to a set of toys and observed playing, however, they liked; visual preference measures, where children were asked to look at pictures of toys; forced choice methods, where children were asked to choose toys or pictures of toys, typically in front of an experimenter; and naturalistic methods, where children’s toy options were not predefined by the researchers or other adults. We found that forced choice methods consistently showed larger gender differences than other methods.There are two possible explanations for this pattern. One is the potential demand characteristics of forced choice paradigms. A request to publicly choose an option may be interpreted as evaluative by children, who then feel obliged to give the answer that they feel is “correct,” rather than indicate their actual preference. Children’s propensity to misunderstand requests for information as tests has been noted in other contexts (e.g., Lamb et al., 2003). Another possibility is that the paradigm creates a false dichotomy. In forced choice methods, the child is usually presented with one boy-related option and one girl-related option and asked to choose between them. There is usually not a neutral option, and, generally, the child must choose only one option and reject the other. In contrast, in a free play paradigm, children typically have more response options available, such as several toys associated with each gender, or neutral toys as well as gender-related toys. Even if only two toys are available, the child has more options than in most forced choice paradigms. For example, if a doll and a car are available, a child may choose to play with the doll, play with the car, play with both the doll and the car, or play with neither. In most forced choice methods, however, children must choose one and only one of two options.
Forced choice methods, in their current form, do not give comparable results to other methods of measuring gender-related toy preferences. Nevertheless, forced choice methods can be an efficient and easily administered measurement tool and therefore may be appropriate for studies where, for example, data need to be collected across a very large group or under difficult conditions. Future investigators wishing to measure gender-related toy preferences with an easily administered tool might do so, however, with the aim of minimizing artificial inflation in effect sizes. For instance, a procedure in which the experimenter cannot see which option the child selects, and the child knows that their response is not seen, might be useful. It also might be useful to include neutral options, as well as gender-related options, and allow the range of possible choices to include “both” or “neither.” These modifications of forced choice methods could provide results that are more comparable to other methods of measuring toy preference and perhaps are more reflective of children’s actual gender-related preferences.
Child Age and Gender-Related Toy Preferences
We found that gender differences in preferences for gender-related toys increased linearly with child age. Our results further suggested that this pattern could be explained by boys’ showing increased preference for gender-typical over gender-atypical toys with age, while girls’ preferences for gender-typical over gender-atypical toys did not increase significantly with age. Similarly, the previous meta-analysis of free play studies (Todd et al., 2018) found an increase in gender-related play with age in boys, although they did not find increasing gender differences. This may reflect a difference in the power of the two meta-analyses; the previous meta-analysis included 16 studies, whereas the current meta-analysis included 75 studies. We did not find significant curvilinear effects of age on children’s gender-related toy preferences.Our findings of linear effects contrast with those of some prior investigations of age effects on children’s gender-related toy preferences. For example, Campbell et al. (2000) measured infants’ gender-related visual preferences longitudinally at ages 3, 9, and 18 months. They found that preferences did not change with age, but the infants were all very young compared to the age range in the wider literature and in the current meta-analysis.
In contrast, our meta-analytic findings suggest that boys’ and girls’ gender-related toy preferences increase with age in a linear fashion. These findings resemble findings for a broader measure of children’s gender-typical behavior, the Pre-School Activities Inventory (PSAI). The PSAI is a 24-item parent report inventory that asks about children’s gender-typed toy preferences and about children’s gender-related activity and playmate preferences. A longitudinal, population study in which the PSAI was completed by a parent to describe their child at ages 2, 3, and 5 years also found that both boys and girls became increasingly gender-typed with age (Golombok et al., 2008).
Our results suggest that children’s toy preferences might become more gender-related with age, as predicted by several theories of gender development. Children might be encouraged, through socialization pressures such as modeling and reinforcement, to prefer same gender-related toys, and the effects of this socialization may accumulate as they get older (Fagot, Rodgers, & Leinbach, 2000). Additionally, based on their early gender-related toy interests, children might gravitate to different social environments, enhancing their early preferences (Liben & Bigler, 2002; Martin et al., 2013). Finally, differences in children’s prenatal and early postnatal hormone exposure may dynamically interact with social environments and cognitive processes to increase children’s gender-related preferences over time (Hines, 2012). Together, these social and cognitive effects, and their interactions with early hormonal influences, may explain the linear increase in gender-related differences with age.
The findings of our meta-analysis, however, are not a substitute for a large, longitudinal study of children’s gender-related toy preferences. We used meta-analytic techniques to compare gender-related preferences in children from different age groups, reported in different studies. Our analysis, therefore, was cross-sectional and does not have the inferential power of a well-controlled longitudinal study. Our results would be best confirmed by a future longitudinal study of children’s gender-related toy preferences from infancy to pre-pubertal age. The longitudinal parent report study using the PSAI (Golombok et al., 2008) is the closest existing example and found similar results to our meta-analysis.
Gender-Related Toy Preferences Over Time
We found no change in the magnitude of gender-related differences in toy preferences across year of publication. The results of the moderator analyses suggested that gender effects on children’s toy preferences have remained generally constant in magnitude across the past five decades. This finding might seem surprising. Since the earliest studies on gender-related toy preferences, gender-atypical behavior and preferences have become increasingly socially acceptable. Perhaps the lack of any discernible pattern of change results from different social pressures influencing gender-related toy preferences in different directions. For example, growing acceptance of gender-atypical behavior may be countered by increasing gender segregation of the toy market.Contrary to our results, a previous meta-analysis of children’s toy preferences (Todd et al., 2018) found that boys and girls played more with gender-related toys in earlier studies than in more recent studies. Todd et al. suggested that increasing gender equality in Western societies could influence children to play with neutral toys, due to increased advertising to children about gender-neutral toys. A recent analysis of online toy marketing, however, found that more toys were marketed for “boys only” or for “girls only” than for both (Auster & Mansbach, 2012), and an analysis of department store catalogs concluded that gender differentiation in toy advertising had increased since the 1980s as marketers employed gender stereotypes to encourage sales (Sweet, 2013). Taken together, these analyses challenge the view that gender-related toy advertising is decreasing with time. Alternatively, the previous finding could be partly explained by the smaller time frame considered in the prior meta-analytic review; the prior review covered about 35 years of research, while the present review covered 50 years.
It may be that children’s preferences are robust to social influences at this macrolevel; or that, despite social change, the underlying cultural environment regarding gendered toys has not changed. A similar result was found in a systematic review of gender stereotypes from the 1970s to the present. Rudman and Glick (2008) hypothesized that women’s changing social roles would be reflected in changing stereotypes of women. Although they found a change in women’s self-concept over time, they also found that more general stereotypes of women’s personalities had not changed. They suggested that the lack of change might be due to people viewing personality as part of the fundamental essence of gender, and therefore being reluctant to modify their stereotypic beliefs about personality. A similar explanation may also apply to toy preferences: if people view toy preferences as an essential part of a child’s gender, they may be unlikely to change their gender-related beliefs about toy preferences. Children may then adapt their actual toy preferences to reflect broader societal beliefs.
Limitations
The meta-analysis could only include data that were reported in the individual toy preference studies. Therefore, we could not analyze variables such as color or shape, or individual toys other than dolls and vehicles. In future research, if investigators report more information about toy characteristics and about individual toys, it may be possible to discover more about what characteristics of different toys make them more likely to be preferred by one gender or another.Our literature search covers papers published to March 2014 and does not include papers published outside of this time frame. More recent papers may therefore be missing from the current meta-analysis. The current meta-analysis, however, synthesizes 50 years of research on toy preferences and finds that toy preference effect sizes have not changed significantly over time. Thus, results from a new review including more recent papers would be unlikely to differ from what we report.
We focused on gender-related preferences in typically developing children. Some studies selected participants specifically because they were not typically developing (for example, clinical samples of children with genetic variants causing atypical early hormone environments, or children who showed gender-related behavior that was noticeably different from their peers). To include these atypical populations in our study might have skewed the results, so we did not include them. Our results, therefore, may not apply to clinical populations.
Additionally, we meta-analyzed only direct measures of children’s toy preferences. We did not, for example, include parent report measures. Similarly, we did not include broader aspects of children’s gender-related behavior, such as activity preferences, playmate preferences, or sex role identification (e.g., Brown, 1956). Additionally, we did not search for these broader terms, so we may have missed papers that included toy preferences in a broader measure of sex role identification or androgyny (e.g., Zucker & Torkos, 1989). It would be interesting to know whether meta-analyses from these other sources of data and types of gender-related behavior would show similar outcomes. We hope that the current systematic review and meta-analysis will encourage such studies.