Friday, March 10, 2023

The COVID-19 epidemic is accompanied by substantially and significantly lower intelligence test scores

Breit M, Scherrer V, Blickle J, Preckel F (2023) Students’ intelligence test results after six and sixteen months of irregular schooling due to the COVID-19 pandemic. PLoS ONE 18(3): e0281779, Mar 8 2023. https://doi.org/10.1371/journal.pone.0281779

Abstract: The COVID-19 pandemic has affected schooling worldwide. In many places, schools closed for weeks or months, only part of the student body could be educated at any one time, or students were taught online. Previous research discloses the relevance of schooling for the development of cognitive abilities. We therefore compared the intelligence test performance of 424 German secondary school students in Grades 7 to 9 (42% female) tested after the first six months of the COVID-19 pandemic (i.e., 2020 sample) to the results of two highly comparable student samples tested in 2002 (n = 1506) and 2012 (n = 197). The results revealed substantially and significantly lower intelligence test scores in the 2020 sample than in both the 2002 and 2012 samples. We retested the 2020 sample after another full school year of COVID-19-affected schooling in 2021. We found mean-level changes of typical magnitude, with no signs of catching up to previous cohorts or further declines in cognitive performance. Perceived stress during the pandemic did not affect changes in intelligence test results between the two measurements.

Discussion

Intelligence test results were lower in the pandemic 2020 sample than in the prepandemic 2002 and 2012 samples. The differences in test scores were large, with a difference in general intelligence of 7.62 IQ points between 2020 and 2002 (Analysis 1a). This difference did not appear to be a continuation of a longer decreasing trend. In contrast, we observed larger test scores in 2012 than in 2002 but lower scores in 2020. The difference between 2012 and 2020 was also substantial, with a difference in general intelligence of 6.54 points (Analysis 1b). The cross-sectional cohort comparisons therefore seem to corroborate previous results that regular schooling has a substantial impact on intelligence development and its absence is detrimental for intelligence test performance [9]. The difference in test scores was remarkably large. It may be the case that the student population was hit particularly hard by the pandemic, having to deal with both the disruption of regular schooling and other side effects of the pandemic, such as stress, anxiety, and social isolation [68]. Moreover, students are usually very accustomed to testing situations, which may be less the case after months of remote schooling.

Creativity scores were notably lower than other scores in 2002. It therefore seems like the nonsignificant difference in creativity between 2002 and 2020 was not due to creativity being unaffected by the pandemic, but instead due to creativity scores being low in 2002. This is supported by significantly higher creativity scores in 2012. Lower creativity in 2002 than in later years may be due to unfamiliarity with the testing format, changes in curricula, or changes in out of school activities.

Importantly, the overall results are inconsistent with one possible alternative explanation of decreasing intelligence test scores, namely, a reverse Flynn effect. Flynn observed a systematic increase in intelligence scores across generations in the 20th century [69]. In some countries, a reversed Flynn effect with decreasing intelligence scores across generations has been observed in recent years [177071]. This seems to be an especially plausible alternative explanation for the observed differences in test scores in our Analysis 1a. However, there are arguments against this alternative explanation. A reversal of the Flynn effect has not yet been observed in Germany. Instead, even in recent years, a regular positive Flynn effect has been reported [4572]. Moreover, a reverse Flynn effect is also inconsistent with our observation of increasing test scores from 2002 to 2012. We observed an increase in General Intelligence equivalent to .47 IQ points per year, which is slightly larger than the typically observed Flynn effect [73] or the Flynn effect observed in Germany [45]. The observed decrease in test scores from 2012 to 2020 with .82 IQ points per year for General Intelligence is also much larger than the reverse Flynn effect observed elsewhere (.32 IQ points) [74], making it unlikely that this effect alone could account for the observed decline.

The longitudinal results (Fig 9) showed an increase in test scores between the test (2020) and retest (2021). The magnitude of the increase is in line with the retest effects for intelligence testing that have been quantified meta-analytically (d = .33) [46]. In some cases the retest effects were larger than expected based on the meta-analysis (e.g., Processing Speed, Figural Ability). However, these cases were largely in line with a previous investigation of retest effects in a subsample of the BIS-HB standardization sample, [75] with no clear pattern of consistently larger or smaller retest effects in the present sample. These results indicate neither a remarkable decrease nor a “catching up” to previous cohorts.

Interestingly, we found no impact of perceived stress on the change in intelligence test scores. A possible explanation for the observed results is that stress levels were especially high in the first months of the pandemic, when there was the greatest uncertainty about the nature of the disease and lockdowns and school closures were novel experiences. Some evidence for a spike in stress levels at the beginning of the pandemic comes from tracking stress-related migraine attacks [76] and from a longitudinal survey of college students that was conducted in April and June 2020, finding the highest stress levels in April [77]. Moreover, teachers and students were both completely unprepared for school closures and online teaching at the beginning of the pandemic. The retest was conducted after a month-long period of regular schooling, followed by a now more predictable and better prepared switch to remote schooling that did not catch teachers and students off guard entirely. These factors may explain why intelligence performance did not drop further and why stress levels did not have an effect on the change in performance in the second test.

Strengths and limitations

The present study has several strengths. To our knowledge, this is the first investigation of the development of intelligence test performance during the pandemic. Moreover, we used a relatively large, heterogeneous sample and a comprehensive, multidimensional intelligence test. We were able to compare the results of our sample with two highly similar prepandemic samples using propensity score matching. Last, we retested a large portion of the sample to longitudinally investigate the development of intelligence during the pandemic.

However, the present study also has several limitations that restrict the interpretation of the results. First, due to the pandemic affecting all students, we were not able to use a control group but had to rely on samples collected in previous years. Cohort effects cannot be completely excluded, although we tried to minimize their influence through propensity score matching and the use of two different prepandemic comparison groups. We could not control for potential differences in socioeconomic status (SES) between the samples because no equivalent measure was used in all three cohorts. It would have been beneficial to control for SES because of its influence on cognitive development and on the bidirectional relationship of intelligence and academic achievement [9]. SES differences between samples therefore may account for some of the observed test score differences. However, large differences in SES between the samples are unlikely because the 2012 and 2020 samples were drawn from the same four schools. Regarding the impact of SES on the longitudinal change during the pandemic in the 2020 sample, we did not have a comprehensive SES measure available. However, we had information on the highest level of education of parents. When adding this variable as a predictor in the LCA analyses, the results did not change, and parents’ education was no significant predictor of change.

Second, both measurement points of the study fell within the pandemic. A prepandemic measurement is not available for our 2020 sample. This limits the interpretation of the change in test scores over the course of the pandemic, even though we compared the observed retest effects with those found in meta-analysis and a previous retest-study of the BIS-HB.

Third, the 2020 measurement occurred only a few weeks after the summer break. It has often been shown that the summer break causes a decrease in math achievement test scores [78] as well as intelligence test scores [79]. However, this “summer slide” effect on intelligence seems to be very modest in size [80] and is therefore unlikely to be fully responsible for the large observed cohort differences in the present investigation.

Fourth, perceived stress was only measured by a short, retrospective scale. The resulting scores may not very accurately represent the actual stress levels of the students over the school year. Moreover, perceived stress was not measured at the first measurement point, so changes in stress levels during the pandemic could not be examined. This limits the interpretation of the absence of stress effects on changes in intelligence.

Fifth, the matched groups in Analysis 1b were somewhat unbalanced with regard to grade level (Table 1). The students in the 2020 sample tended to be in higher grades while being the same age. However, this pattern is unlikely to explain the differences in intelligence. The students in the 2020 sample tended to have experienced more schooling at the same age than the other samples, which would be expected to be beneficial for intelligence development [1011].

Sixth, there was some attrition between the first and second measurement of the 2020 sample. This was due to students changing schools or school classes, being sick or otherwise absent on the second day of testing or failing to provide parental consent for the second testing. It may be plausible that especially students with negative motivational or intellectual development changed school or avoided the second testing. This means that the improvement between the first and second time of measurement may be somewhat overestimated in the present analyses.

Seventh and last, only a modest percentage of the samples were matched in the PSM procedure because we followed a conservative recommendation for the caliper size [55] that yielded a very balanced matching solution. The limited common support somewhat diminishes the generalizability of the findings to the full samples.

Implications

The pandemic and the associated countermeasures affected the academic development of an entire generation of students around the world, as evidenced by decreases in academic achievement [3]. Simulations predict a total learning loss between .3 and 1.1 school years, a loss valued at approximately $10 trillion [81]. Although we cannot make any causal claims with the present study, our results suggest that these problems might extend to students’ intelligence development. They point out that possible detrimental effects especially took place during the first months of the pandemic. Moreover, our longitudinal results do not point to any recovery effects.

As schooling has a positive impact on students’ cognitive development, educational institutions worldwide have a chance to compensate for such negative effects in the long term. As interventions aimed at the improvement of academic achievement also affect intelligence, [9] the decline in intelligence could be recovered if targeted efforts are made to compensate for the deficit in academic achievement that has occurred. Furthermore, schools could pay attention to offering intellectually challenging lessons or supplementary programs in the afternoons or during vacations, as intellectually more stimulating environments have a positive effect on intelligence development [82].

A second implication concerns current intelligence testing practice. If there is a general, substantial decrease in intelligence test performance, testing with prepandemic norms will lead to an underestimation of the percentile rank (and thus IQ) of the person being tested. This can have significant consequences. For example, some giftedness programs use IQ cutoffs to determine eligibility. Fewer students tested during (or after) the pandemic may meet such a criterion. If the lower test performance persists even after the pandemic, it may even be necessary to update intelligence test norms to account for this effect.

As discussed in the previous section, the present study has several limitations. The results can therefore only be regarded as a first indication that the pandemic is affecting intelligence test performance. There is a need for further research on this topic to corroborate the findings. It is obviously no longer possible to start a longitudinal project with prepandemic measurement points. However, the present article presented a way to investigate the effect of the pandemic if prepandemic comparison samples are available. Ideally, the prepandemic samples would have been assessed shortly before the pandemic onset to minimize differences between cohorts due to the (reverse) Flynn effect, changes in school curricula, or school policy changes. If a sample was assessed very recently before the pandemic, it may also be possible to retest the participants for the investigation of the pandemic effects. Although we cannot make any causal claims with the present study, our results suggest that COVID-19-related problems might extend to students’ cognitive abilities. As intelligence plays a central role in many areas of life, it would be important to further investigate differences between prepandemic and current student samples to account for these differences in test norms and for possible disadvantages by offering specific interventions.


Contrary to the cliché widespread among intellectuals of ordinary people as easily deceived simpletons, humans have an evolutionary rooted distrust of what others say., of all things, this "epistemic vigilance" may be the foundation for delusions

Delusions as Epistemic Hypervigilance. Ryan McKay, Hugo Mercier. Current Directions in Psychological Science, March 8, 2023. https://doi.org/10.1177/09637214221128320

Abstract: Delusions are distressing and disabling symptoms of various clinical disorders. Delusions are associated with an aberrant and apparently contradictory treatment of evidence, characterized by both excessive credulity (adopting unusual beliefs on minimal evidence) and excessive rigidity (holding steadfast to these beliefs in the face of strong counterevidence). Here we attempt to make sense of this contradiction by considering the literature on epistemic vigilance. Although there is little evolutionary advantage to scrutinizing the evidence our senses provide, it pays to be vigilant toward ostensive evidence—information communicated by others. This asymmetry is generally adaptive, but in deluded individuals the scales tip too far in the direction of the sensory and perceptual, producing an apparently paradoxical combination of credulity (with respect to one’s own perception) and skepticism (with respect to the testimony of others).

Epistemic Vigilance

A set of putative cognitive mechanisms serves a function of epistemic vigilance: to evaluate communicated information so as to accept reliable information and reject unreliable information (Sperber et al., 2010). The existence of these mechanisms has been postulated on the basis of the theory of the evolution of communication (e.g., Maynard Smith & Harper, 2003Scott-Phillips, 2008). For communication between any organisms to be stable, it must benefit both those who send the signals (who would otherwise refrain from sending them) and those who receive them (who would otherwise evolve to ignore them). However, senders often have incentives to send signals that benefit themselves but not the receivers. As a result, for communication to remain stable, there must exist some mechanism that keeps signals, on average, reliable. In some species, the signals are produced in such a way that it is simply impossible to send unreliable signals—for instance, if the signal can be produced only by large or fit individuals (see, e.g., Maynard Smith & Harper, 2003). In humans, however, essentially no communication has this property.1 It has been suggested instead that humans keep communication mostly reliable thanks to cognitive mechanisms that evaluate communicated information, rejecting unreliable signals and lowering our trust in their senders—mechanisms of epistemic vigilance.
To evaluate communicated information, mechanisms of epistemic vigilance process cues related to the content of the information (Is it plausible? Is it supported by good arguments?) and to its source (Are they honest? Are they competent?). A wealth of evidence shows that humans possess such well-functioning mechanisms (for review, see, e.g., Mercier, 2020), that they are early developing (being already present in infants or toddlers; see, e.g., Harris & Lane, 2014), and that they are plausibly universal among typically developing individuals. Crucially for the point at hand, these epistemic vigilance mechanisms are specific to communicated information. Our own perceptual mechanisms evolved to best serve our interests, and there are thus no grounds for subjecting their deliverances to the scrutiny that must be deployed for other individuals.
There is now a large amount of evidence that people systematically discount information communicated by others. This tendency has often been referred to as egocentric discounting (Yaniv & Kleinberger, 2000), and it has been observed in a wide variety of experimental settings (for a review, see Morin et al., 2021). For instance, in advice-taking experiments, participants are asked a factual question (e.g., What is the length of the Nile?), provided with someone else’s opinion, and given the opportunity to take this opinion into account in forming a final estimate. Overall, participants put approximately twice as much weight on their initial opinion as on the other participant’s opinion, even when they have no reason to believe the other participant less competent than themselves (Yaniv & Kleinberger, 2000).
The discounting of others’ opinions can be overcome if we have positive reasons to trust them or if they present good arguments—in particular, if our prior opinions are weak (see, e.g., Mercier & Sperber, 2017). However, in the absence of such positive reasons, discounting is a pervasive phenomenon. There is no such systematic equivalent when it comes to perception. Although in some cases we can or should learn to doubt what we perceive (e.g., when attending to the reminder that “objects in mirror are closer than they appear” while driving), this is typically an effortful process with uncertain outcomes. In visual perception, for example, models in which the observer behaves like an optimal Bayesian learner have proven very successful at explaining participants’ behavior (e.g., Geisler, 2011). Even if there are deviations from this optimal behavior (e.g., Stengård & van den Berg, 2019), they do not take the form of a systematic tendency to favor our priors over novel information.
There is thus converging evidence (a) that humans process communicated information differently than information they acquire entirely by their own means and (b) that the former is systematically discounted by default (i.e., in the absence of reasons to behave otherwise, such as reasons to believe the source particularly trustworthy or competent). This, however, leaves open significant questions of great relevance for the present argument. In particular, to what stimuli does epistemic vigilance apply to? Presumably, epistemic vigilance evolved chiefly to process the main form of human communication: ostensive communication, which includes verbal communication but also many nonverbal signals (from pointing to frowning). Related mechanisms apply to other types of communication, such as emotional communication (Dezecache et al., 2013).
What of behaviors that have no ostensive function (e.g., eating an apple) or even aspects of our environment that might have been modified by others (e.g., a book found on the coffee table)? Although such stimuli should not trigger epistemic vigilance by default, they may under some circumstances. One might interpret a friend eating an apple as an indication that the friend has followed health advice to eat more fruit, or one could interpret one’s spouse’s placement of a book on a table as an invitation to read it—whether it was so intended or not. The behavior might then be discounted: We might suspect our friend of eating the apple only for our benefit while privately gorging on junk food.
Other cognitive mechanisms, more akin to strategic reasoning, but bound to overlap with epistemic vigilance, must process noncommunicative yet manipulative information (on the definition of communication vs. manipulation or coercion, see Scott-Phillips, 2008). A detective should be aware that some clues might have been placed by the criminal to mislead her. In some circumstances, therefore, epistemic vigilance and related mechanisms might apply even to our material environments, instead of applying only to straightforward cases of testimony. Still, epistemic vigilance should always apply to testimony, whereas it should apply to perception only under specific circumstances, such that the distinction between these two domains (testimony vs. perception) remains a useful heuristic.
How might these considerations inform our understanding of delusions? Whereas in healthy individuals the scales are adaptively tipped in favor of trusting the perceptual over the ostensive, this imbalance may be maladaptively exacerbated in delusions (Fig. 1). This could be for at least two complementary reasons: Sensory or perceptual evidence may be overweighted, and testimonial evidence may be underweighted. We review each of these possibilities in turn.

Thursday, March 9, 2023

There is a positive relationship between intelligence and survival; intelligence is a protective factor for reaching upper-middle age, thereafter survival depends less on intelligence and more on other factors

Intelligence and life expectancy in late adulthood: A meta-analysis- Macarena Sánchez-Izquierdo et al. Intelligence, Volume 98, May–June 2023, 101738. Mar 2023. https://doi.org/10.1016/j.intell.2023.101738

Highlights

• There is a positive relationship between intelligence and survival.

• The most robust moderator is years of follow-up.

• Intelligence is a protective factor for reaching upper-middle age, thereafter survival depends less on intelligence and more on other factors.

Abstract: In an aging society, it is crucial to understand why some people live long and others do not. There has been a proliferation of studies in recent years that highlight the importance of psycho-behavioural factors in the ways of aging, one of those psychological components is intelligence. In this meta-analysis, the association between intelligence and life expectancy in late adulthood is analysed through the Hazard Ratio (HR). Our objectives are: (i) to update Calvin's meta-analysis, especially the estimate of the association between survival and intelligence; and (ii) to evaluate the role of some moderators, especially the age of the participants, to explore intelligence–mortality throughout adulthood and old age. The results show a positive relationship between intelligence and survival (HR•: 0.79; 95% CI: 0.81–0.76). This association is significantly moderated by the years of follow-up, the effect size being smaller the more years elapse between the intelligence assessment and the recording of the outcome. Intelligence is a protective factor to reach middle-high age, but from then on survival depends less and less on intelligence and more on other factors.

Keywords: IntelligenceMortalitymeta-analysisSystematic review

4. Discussion

>20 longitudinal studies from several countries (Australia, Denmark, Luxembourg, Sweden, United Kingdom, and USA) have demonstrated the link between higher intelligence and longer life. This gave rise to the field of cognitive epidemiology, which focuses on understanding the relationship between cognitive functioning and health.

This study aimed to update Calvin's meta-analysis, confirm the quantification of this association and also analyse whether the intelligence–mortality association varies across adulthood and old age. We found evidence that having intelligence of at least 1-SD above the mean seems to reduce the mortality rate, although our rate was a little lower (21.6%) than that of Calvin's meta-analysis (24%).

Another objective of the study was to analyse the influence of several factors as possible moderators, specifically bias, age, and sex. Our results showed that recent studies tend to find a weaker association between intelligence and mortality than older studies. Along this line, Calvin and colleagues have shown a trend for larger cohorts accumulating in more recent years (Calvin et al., 2011).

The average life expectancy of women exceeds that of men, however, sex does not moderate the association between intelligence and mortality, being the same for men and women, as previously reported in twin longitudinal studies (Arden et al., 2016) and Calvin et al. (2011).

The question “What causes the relationship between intelligence and longevity/ mortality?” remains unsolved and crucial. Factors such as childhood environment, family income, schooling, and healthy/unhealthy lifestyle habits (diet, exercise, tobacco use, alcohol, illnesses), have been studied (Deary, Weiss, & Batty, 2010Whalley & Deary, 2001). As Deary et al. (2021) suggested, there seems to be a reciprocal dynamic association between intelligence and health throughout life, and although there are several constructs associated with health/ illness and death (e.g., parental social class, intelligence in youth, more education, higher health literacy, healthy behaviors, and more affluent social class) shared genetic differences are likely to account for only a small proportion of these associations.

In this study we wanted to re-evaluate the influence of Socio-economic status as a predictor of mortality; our results showed that childhood SES did not moderate the potential of intelligence for predicting mortality. Although several studies (Batty et al., 2007Hemmingsson, Melin, Allebeck, & Lundberg, 2009) suggested that intelligence had effects on the risk of mortality independent from those of early socio-economic influences, other studies suggested that SES was not a confounder of the intelligence–mortality association (Calvin et al., 2011Calvin et al., 2017). Furthermore, a study of over 900 Scottish participants (Hart et al., 2003) found that statistically controlling for economic class and a measure of “deprivation” reflecting unemployment, overcrowding, and other adverse living conditions accounted for only about 30% of the IQ-mortality correlation.

Along this line, Gottfredson (Gottfredson, 2004) argued that underlying IQ differences explained social inequalities in health and that these were not necessarily mediated via adult/person's-own SES. This idea was tested by Batty, Der, Macintyre, and Deary (2006) who found that IQ does not completely explain socioeconomic inequalities in health, however, it might contribute to them through a variety of processes.

Another line of research suggested that genes may contribute to the link between IQ and mortality. Arden and colleagues (Arden et al., 2016) analysed three twin studies (from the U.S., Denmark, and Sweden) and found a small positive phenotypic correlation between intelligence and lifespan, furthermore, in the combined sample, the genetic contribution to covariance was 95%; in the US study, 84%; in the Swedish study, 86%, and in the Danish study, 85%. As the authors highlighted, any genetic factors that contribute to intelligence and mortality may operate indirectly via good health choices or higher income which leads to better healthcare. Deary, Harris, and Hill (2019)Deary et al. (2021)) reviewed the genetics through genome-wide association studies (GWASs), Genome-wide complex trait analysis (GREML), and LD regression studies, which allowed them to estimate genetic correlations between phenotypes (intelligence and health).

The second question of this study: “Does the relationship between intelligence and mortality change in the older adults?” Yes, that relationship changes when the most long-lived studies were compared with the youngest studies. As several studies have suggested (Arden et al., 2016; Hart et al., 2005), the causes of the association between intelligence and lifespan may vary between ages. Childhood IQ has been related to mortality in Scottish populations: Hart et al., (2005) showed that childhood IQ was significantly related to deaths occurring up to age 65, but not to deaths occurring after age 65, whereas the Aberdeen study found that people with a lower IQ were less likely to be alive at age 76 (Whalley & Deary, 2001).

Analysing whether the relationship between intelligence and mortality changes in the older adults, our results showed a small but significant positive slope for FUY, which reflected that the association was slightly smaller as more years elapsed between time 1 (intelligence assessment) and time 2 (check for survival). This means that the relationship between intelligence and survival is dampened. Our findings confirm results from the Midspan studies (Hart et al., 2005). As suggested by several studies, one possible reason might be that higher IQ might be associated with better healthcare and engaging in healthier behaviors (Deary et al., 2019; Hart et al., 2005; Wraw, Der, Gale, & Deary, 2018), which is associated to a lower mortality risk (Gottfredson, 2004Gottfredson & Deary, 2004). IQ might also predispose to conditions of adult life (Marmot & Kivimäki, 2009), quitting smoking in later life (Batty et al., 2007Daly & Egan, 2017), and entry into safer environments (Whalley & Deary, 2001), which promote staying healthier and living longer.

One way of discovering why intelligence and mortality are related and why this association seems to be smaller at higher ages might be to review the specific causes of death to which intelligence relates from childhood and adulthood. Along this line, several studies have shown its association with most of the major causes of death. The main literature has reported inverse patterns of the association between childhood intelligence and respiratory disease (Batty, Deary, & Zaninotto, 2016Calvin et al., 2017), coronary heart disease (e.g., Calvin et al., 2017Christensen, Mortensen, Christensen, & Osler, 2016Hart et al., 2004Lawlor, Ronalds, Clark, Davey Smith, & Leon, 2005;), stroke (Calvin et al., 2017), total cardiovascular disease (Batty et al., 2016Calvin et al., 2017Christensen et al., 2016Hart et al., 2003Hart et al., 2004Hemmingsson, Melin, Allebeck, & Lundberg, 2006Leon, Lawlor, Clark, Batty, & Macintyre, 2009), digestive disease (Calvin et al., 2017), cancer (Batty et al., 2009Batty et al., 2016Leon et al., 2009), specifically with smoking-related diseases (Calvin et al., 2017), dementia (Calvin et al., 2017Russ et al., 2013), and suicide (Hemmingsson et al., 2006,

Deary et al. (2021) presented consistent results showing intelligence associated with several causes of death (cardiovascular disease, coronary heart disease, stroke, respiratory disease, diabetes, digestive disease, dementia, non-smoking-related cancers, accidents and suicide), illnesses (hypertension, metabolic syndrome, diabetes, schizophrenia, and major depression), health biomarkers (e.g. systolic and diastolic blood pressure, heart rate, triglycerides and cholesterol, body mass index), and health behaviors (smoking and physical inactivity). As the authors highlight, intelligence's long-term association with health is mediated via adult social factors and health behaviors.

5. Limitations

The present meta-analysis includes large published studies representing in total >47,000 average sample size. However, it includes 22 studies: 16 studies were already included in the previous meta-analysis (Calvin et al., 2011) and six new articles.

Although all studies were adjusted for multiple potential moderators, there are likely to remain different factors, such as SES in adulthood, cause of death in the intelligence–mortality association, etc., that could substantially affect the results.

For those reasons, the combined models are not strictly comparable, since other moderators are frequently added to childhood SES and it is not possible to disentangle their effects in the meta-analysis. Although we are aware of this potential weakness, we have preferred to perform and report this analysis. If we had found an effect, it would have been difficult to interpret, but we did not any effect of this moderator. As might be expected, the inclusion of moderators leads to a significant increase in heterogeneity, which we may interpret to mean that some of the moderators increase the effect and others reduce it, but we cannot identify the role each of them plays at the meta-analytic level.

Also, future research should explore mediating effects on a pathway from premorbid intelligence to the risk of mortality, taking into account common genetic effects (e.g. with GWAS) and the role of socioeconomic status, health literacy, and adult environments and behaviors.

It should also be important to include other countries and cultures in the studies.


Liberalizing prostitution leads to a significant decrease in rape rates, while prohibiting it leads to a significant increase (much greater than liberalization's effect)

Do Prostitution Laws Affect Rape Rates? Evidence from Europe. Huasheng Gao and Vanya Petrova. The Journal of Law and Economics, Volume 65, Number 4, Mar 2023. https://www.journals.uchicago.edu/doi/epdf/10.1086/720583

Abstract: We identify a causal effect of the liberalization and prohibition of commercial sex on rape rates, using staggered legislative changes in European countries. Liberalizing prostitution leads to a significant decrease in rape rates, while prohibiting it leads to a significant increase. The results are stronger when rape is less severely underreported and when it is more difficult for men to obtain sex via marriage or partnership. We also provide the first evidence for the asymmetric effect of prostitution regulation on rape rates: the magnitude of prostitution prohibition is much larger than that of prostitution liberalization. Placebo tests show that prostitution laws have no impact on nonsexual crimes. Overall, our results indicate that prostitution is a substitute for sexual violence and that the recent global trend of prohibiting commercial sex (especially the Nordic model) could have the unforeseen consequence of proliferating sexual violence.

        If you expel prostitutes from society, you will unsettle everything on account of lusts. (St. Augustine, in Richards 1995, p. 118)1


Wednesday, March 8, 2023

Personality disorders reveal much stronger sex differences than normal personality traits, with men leaning much more towards the "dark" side

Sex in the dark: Sex differences on three measures of dark side personality. Adrian Furnham, George Horne. Acta Psychologica, Volume 234, April 2023, 103876. https://doi.org/10.1016/j.actpsy.2023.103876

Abstract: This study examined sex differences in the scores on three different measures of the personality disorders (PDs) all derived from on-line surveys. Two groups (total N = 871) completed the Coolidge Axis-II Inventory which assessed 14 PDs; two groups (total N = 732) completed the Short Dark Tetrad which assessed 4 PDs; four groups (total N = 1558) completed the Personality Inventory for DSM-5—Brief Form which assessed 5 PD dimensions. Cohen's d after ANOVAs, and binary regression analysis revealed consistent findings. In this study we calculated 63 d statistics of which 5 were d > 0.50 and 28 were d > 0.20. In two samples, each using two different instruments, men scored higher than women on Anti-Social, Narcissistic and Sadistic PD which is a consistent finding in the literature. Speculations are made about the origin of these differences. Limitations are acknowledged.

Keywords: SexPersonalityTraitsDisordersEffect sizeBright/dark side

4. Discussion

The results of this study are largely consistent with previous research in this area, and confirms they hypotheses. This paper raises a number of points. First, the consistency of the PD sex differences across samples who took the same test, and second across PDs measured between different tests. With regard to the consistency between samples there seemed “reasonable” agreement particularly with those that were most and least significantly different. In all, we had eight participant groups with an 232 < N < 506 who were recruited on-line. In no instance did analyses show opposite results with the exception of one group tested on the DSM-5 where men scored higher than women on the Negative Affectively scale in contrast to the other three groups. Thus, we have demonstrated the generalisability of results across very different measures, in eight different samples.

A major question concerns sex differences in “bright-” as opposed to “dark-side” traits. Furnham and Treglown (2021) who looked at six tests found the Cohen's d statistic showed very few (3 out of 130) differences >0.50. In a study of dark-side traits, Furnham and Grover, 2022aFurnham and Grover, 2022b found a Cohen's d statistic showed very few (5 out of 44) differences >0.20. In this study however we calculated 63 d statistics of which 5 were d > 0.50 and 28 were d > 0.20. Thus, it appears there are more differences on dark-, as opposed to bright-side measures. This finding requires an explanation and further investigations. However, we have to acknowledge that overall, there are both relatively few and small sex differences, an observation made by many in this area.

We were also able to compare sex differences on different measures of the same trait as the SCATI and the Dark Tetrad both measured Anti-Social, Narcissistic and Sadistic PD. This was consistent between the samples and the instruments showing the following d scores: Anti-Social: 0.30, 0.28, 63, 0.69; Narcissistic: 0.26, 0.84, 0.32, 0.83 and Sadistic PD 0.41; 0.25, 0.80, 0.49. These results confirm the previous literature on Anti-Social and Narcissistic PD but highlight the role of Sadistic PD which, admittedly does not appear as a PD in any of the DSM manuals (American Psychiatric Association, 2000American Psychiatric Association, 2015). It explains also why so many studies on powerful derailed individuals nearly always highlight men rather than women (Babiak & Hare, 2006).

An examination of the Binary Logistic Regressions showed that the Exp(B) varied mainly between 0.80 and 1.20 the lowest being 0.63 for Antagonistic for Group 1 and the highest being 1.68 for Negative Effect for Group 1. Again, depending on cut-interpretations these could be considered high or low.

However, it does appear from this data that having a PD is predominantly a “male problem” in that on all four Tetrad traits, and four of the five DSM-5 dimensions males scored significantly higher than females across all samples. The SCATI did show that where there were consistent findings across the two samples and a d > 0.10 women did score higher on Borderline, Dependent, and Schizotypal, which has been established in previous studies.

There appears to be relatively little theoretical development in the PD literature about the “causes” of the different PDs that may lead to very clear hypothesis testing. Whilst it would not be difficult to develop an evolutionary-based theory explaining why men might be higher on Anti-Social and Narcissistic PD it seems much more difficult to explain why women might score more highly on other PDs like Borderline or Schizotypal. In this sense few of the sex difference studies in PD are theoretically, rather than psychometrically, driven.

5. Conclusion

The strengths of this paper was to report sex difference in the PDs using multiple measures (three) and multiple samples (eight). The results suggest that compared to studies of sex differences in bright-side (normal) personality where sex differences are common but small, sex differences in (some) dark-side traits are consistently larger.

This study has implications for the theory, measurement and indeed treatment of the PDs. From an evolutionary psychology perspective it seems possible to explain some of these differences: for instance, why boldness, fearlessness and self-confidence maybe beneficial to males, though in excess a disadvantage. Equally it may be possible to explain some sex differences in the traditional socialisation of children into established sex roles.

As the movement of PD researchers from a categorical to a dimensional perspective progresses we should be able to inspect sex differences seeking to establish consistency of findings and following that explanations.

6. Limitations

There are frequent critiques that online survey data is often problematic with participants being perfunctory in their responses. In each study we included IQ items as well as other checks to be able to inspect the quality of the responses. In most studies we removed a small number of participants before the analysis with concerns about the quality of their data.

This study explored the data bank of a research group. Nearly all the participants were functioning working adults and not a student or clinical sample, though it is possible that a small number were present in each study. Although we had a lot of data on each participant it was not consistent between samples. Furthermore, it would have been desirable to have a lot more data on each person such as their education and general mental health.

Of the two categorical measures the SCATI has been used in a number of studies and appears to have adequate psychometric properties but is not a particularly well-known measure. The Dark Tetrad measure on the other hand is relatively new though attracting a good deal of attention (Alavi et al., 2022Fernández-del-Río et al., 2022Jain et al., 2022). However, the DSM-5 is now 10 years old and has been used in many studies. Studies such as this serve to describe sex differences but give no indication in their cause or consequence. Thus, they can show that differences exist but not why.


Income and emotional well-being

Income and emotional well-being: A conflict resolved. Matthew A. Killingsworth, Daniel Kahneman, and Barbara Mellers. Proc. Natl. Acad. Sci., March 1 2023, 120 (10) e2208661120. https://doi.org/10.1073/pnas.2208661120

Significance: Measures of well-being have often been found to rise with log (income). Kahneman and Deaton [Proc. Natl. Acad. Sci. U.S.A. 107, 16489–93 (2010)] reported an exception; a measure of emotional well-being (happiness) increased but then flattened somewhere between $60,000 and $90,000. In contrast, Killingsworth [Proc. Natl. Acad. Sci. U.S.A. 118, e2016976118 (2021)] observed a linear relation between happiness and log(income) in an experience-sampling study. We discovered in a joint reanalysis of the experience sampling data that the flattening pattern exists but is restricted to the least happy 20% of the population, and that complementary nonlinearities contribute to the overall linear-log relationship between happiness and income. We trace the discrepant results to the authors’ reliance on standard practices and assumptions of data analysis that should be questioned more often, although they are standard in social science.

Abstract: Do larger incomes make people happier? Two authors of the present paper have published contradictory answers. Using dichotomous questions about the preceding day, [Kahneman and Deaton, Proc. Natl. Acad. Sci. U.S.A. 107, 16489–16493 (2010)] reported a flattening pattern: happiness increased steadily with log(income) up to a threshold and then plateaued. Using experience sampling with a continuous scale, [Killingsworth, Proc. Natl. Acad. Sci. U.S.A. 118, e2016976118 (2021)] reported a linear-log pattern in which average happiness rose consistently with log(income). We engaged in an adversarial collaboration to search for a coherent interpretation of both studies. A reanalysis of Killingsworth’s experienced sampling data confirmed the flattening pattern only for the least happy people. Happiness increases steadily with log(income) among happier people, and even accelerates in the happiest group. Complementary nonlinearities contribute to the overall linear-log relationship. We then explain why Kahneman and Deaton overstated the flattening pattern and why Killingsworth failed to find it. We suggest that Kahneman and Deaton might have reached the correct conclusion if they had described their results in terms of unhappiness rather than happiness; their measures could not discriminate among degrees of happiness because of a ceiling effect. The authors of both studies failed to anticipate that increased income is associated with systematic changes in the shape of the happiness distribution. The mislabeling of the dependent variable and the incorrect assumption of homogeneity were consequences of practices that are standard in social science but should be questioned more often. We flag the benefits of adversarial collaboration.