Saturday, March 25, 2023

Replication failure... No effects of exposure to women's fertile window body scents on men's hormonal and psychological responses

No effects of exposure to women's fertile window body scents on men's hormonal and psychological responses. James R. Roney et al. Evolution and Human Behavior, March 22 2023. https://doi.org/10.1016/j.evolhumbehav.2023.03.003

Abstract: Do men respond to women's peri-ovulatory body odors in functional ways? Prior studies reported more positive changes in men's testosterone and cortisol after exposure to women's scents collected within the putative fertile window (i.e., cycle days when conception is possible) compared to comparison odors, and also psychological priming effects that were differentially larger in response to the fertile window odors. We tested replication of these patterns in a study with precise estimation of women's ovulatory timing. Both axillary and genital scent samples were collected from undergraduate women on six nights spaced five days apart. Here, we tested men's responses to a subset of these samples that were chosen strategically to represent three cycle regions from each of 28 women with confirmed ovulation: the follicular phase prior to the start of the fertile window, the fertile window, and the luteal phase. A final sample of 182 men were randomly assigned to each smell one scent sample or plain water. Saliva samples were collected before and after smelling to assess changes in testosterone and cortisol, and psychological measures of both sexual priming and social approach motivation were assessed after stimulus exposure. Planned comparisons of fertile window to other stimuli revealed no statistically significant effects for any dependent variable, in spite of sufficient power to detect effect sizes reported in prior studies. Our findings thus failed to replicate prior publications that showed potentially adaptive responses to women's ovulatory odors. Discussion addresses the implications of these findings for the broader question of concealed ovulation in humans.

Keywords: Scent attractivenessConcealed ovulationTestosteroneCortisolHuman mating

4. Discussion

As a general summary, we found no compelling evidence that men exhibit differential hormonal or psychological responses to women's body odors collected near ovulation relative to their responses to body odors from other cycle regions (or to plain water). The nonsignificant findings occurred despite our study having sufficient power to detect effect sizes that have been reported in the prior literature. Furthermore, Bayes factors computed for each of our dependent variables suggested that the observed data were about 10 to 13 times more likely under null models than under models including the fertile window contrast, and Bayes factors in this range have been argued to provide strong evidence in favor of the null hypothesis (see Schonbrodt & Wagenmakers, 2018).

4.1. Hormone responses to scent stimuli

Our results did not replicate prior findings of more positive testosterone or cortisol changes after exposure to scents collected near ovulation relative to comparison scents (Cerda-Molina et al., 2013Miller & Maner, 2010). Our findings for testosterone were more similar to those of Roney and Simmons (2012), who reported no significant differences in hormone changes after exposure to peri-ovulatory scents vs. after exposure to plain water. Among prior studies, only Cerda-Molina et al. (2013) measured differential cortisol responses to women's peri-ovulatory body odors, and they reported a complex pattern whereby cortisol rose above basal concentrations at 15 min post-exposure for peri-ovulatory stimuli and for luteal vulvar stimuli at post 30 min., but fell below baseline concentrations for luteal axillary stimuli at 15 and 60 min. post-exposure. Our results at 15 min. post-exposure did not replicate those patterns.

What may account for differences between results of the current study and those of prior studies that have reported significant hormone responses to women's peri-ovulatory body scents? Miller and Maner (2010) used whole T-shirts as scent stimuli as opposed to our use of gauze pads; although Cerda-Molina et al. (2013) employed similar collection methods to those in the present study, we cannot rule out the possibility that hormone responses may be more reliable in response to shirt stimuli. A possible limitation of our method was the longer time that we stored frozen samples before use in testing (up to a year, as opposed to samples being used within a week in Miller and Maner (2010) and Cerda-Molina et al. (2013)), although studies that have varied length of storage have provided evidence that responses to human body scents are not affected by long freezing times (Gomes et al., 2020Lenochova et al., 2009). We estimated ovulatory timing more precisely via use of LH tests than did Miller and Maner (2010) who used highly error-prone counting methods (see Gangestad et al., 2016), and this should have increased our probability of finding true effects. Cerda-Molina et al. (2013) cited two factors that might explain discrepancies between their results and the null effects reported by Roney and Simmons (2012)—the longer stimulus collection time in their study and evidence that men in their study were aware that they were smelling women—but both of these differences were eliminated in the present study in which women collected scents overnight and male participants were explicitly told that they were smelling odors from women.

A salient difference between our methods and those of Cerda-Molina et al. (2013) was their use of a nebulizer containing scent stimuli (or plain air) in order to forcefully project odors into participants' nasal passages. It is possible that this method produces hormone responses in perceivers that are absent after taking deep sniffs from jars containing scent stimuli. The ecological validity of the nebulizer delivery method is uncertain. On the one hand, it may deliver stimuli of supra-normal intensity that are not encountered under real-world conditions. On the other hand, it is possible that this method approximates the greater intensity of odor exposure that might occur during some forms of sexual contact. In any case, this difference in scent delivery method presents a possible reason for the discrepancy in findings across the two studies.

It is also possible that prior positive findings for men's hormone responses to women's peri-ovulatory body odors were false positive results. The patterns described in Cerda-Molina et al. (2013) were particularly striking in that men generally responded with testosterone increases after smelling peri-ovulatory stimuli but testosterone decreases after exposure to luteal stimuli. That pattern suggested that men's hormone responses might be strong enough that they could accurately diagnose women's ovulatory timing from scent cues alone. The current findings shed at least some doubt on the robustness of those findings. Future research would ideally provide additional evidence.

4.2. Psychological responses to scent stimuli

Our results also failed to replicate prior findings suggesting the priming of sexual concepts after exposure to peri-ovulatory scent stimuli relative to comparison stimuli. For two dependent variables, we employed measures verbatim from Miller and Maner (2011): the word stem completion task, and a measure of attribution of sexual arousal to the scent donor. A difference in data analysis between studies was the addition of the Chemical Sensitivity Scale (CSS; Nordin, Millqvist, Lowhagen, & Bende, 2003) to the data analyses in Miller and Maner (2011). The scale measures participants' conscious awareness of odors in their environment. For the word stem task, Miller and Maner (2011) added controls for main and interaction effects for scores on this scale in the model testing effects of scent exposure condition. For the sexual arousal attribution task, they reported no main effect of scent exposure condition but a significant interaction between scent condition and CSS scores such that only among men with high smell sensitivity was greater sexual arousal attributed to the peri-ovulatory scents relative to luteal scents. We did not administer this scale, and this difference in method could help to explain discrepant findings for these variables. However, simulation data show that the addition of covariates and testing for interactions with individual difference variables are practices that can inflate type I error rates (Simmons, Nelson, & Simonsohn, 2011), which adds some doubt to the positive findings for the word stem and arousal attribution variables. Furthermore, if sexual priming effects were specialized adaptations for responding to cues of women's ovulatory timing, one would not expect their expression to be restricted only to men with highly sensitive senses of smell. Thus, the overall data pertaining to these variables—including the non-significant findings in the present study—appear to provide weak evidence for adaptations that produce sexual priming effects in response to ovulatory scent cues.

As a more direct measure of sexual priming, we also queried how much sexual desire men felt after exposure to scent cues. There were no significant effects of cycle phase for this variable (see Table 1 and Fig. 4c). Cerda-Molina et al. (2013) administered an “an interest in sex” scale and reported higher scores after exposure to peri-ovulatory scent stimuli, but the scale was quite heterogeneous and included trait-like items (e.g., “[how high do] you think that your sexual desire normally is?”) in addition to measures of current states. As with the hormone responses, it is possible that a nebulizer delivery of scents would produce stronger fertile window effects on men's self-reported sexual desire than those found here.

We did find a main effect of stimulus type on sexual desire such that men who smelled the armpit stimuli reported higher desire than those who smelled the pantyliner stimuli. Additional data analyses supported subjective scent attractiveness ratings as mediating this effect of stimulus type. The positive correlation between scent attractiveness ratings and sexual desire supports the possibility that desire responds to odor attractiveness in general even if it does not respond reliably to scents produced during the fertile window. Odor attractiveness may be related to variables like health (e.g., Olsson et al., 2014) or immune compatibility (e.g., Thornhill et al., 2003), and thus responding to it with desire may have functions aside from ovulation detection.

We also administered a custom social approach motivation scale but scores on it were not differentially higher after exposure to scents from the fertile window (see Table 1 and Fig. 4d). Tan and Goldman (2015) used an indirect behavioral measure to provide evidence that men exposed to peri-ovulatory scents were motivated to sit closer to women, and it is possible that our findings would have differed with such a measure. Oren and Shimone-Tsoory (2019) provided evidence that single but not paired men exhibited greater social perception abilities after exposure to peri-ovulatory scents. Although we did not measure social perception, we did assess the possible moderating influence of relationship status for the effects of cycle phase on our dependent measures, in part motivated by the findings of the social perception study. Results presented in SOM provide no compelling evidence that hormonal or psychological responses to fertile window stimuli were consistent with prior positive findings in the subset of single men.

4.3. Implications for concealed ovulatory timing

Our findings argue against the possibility that human ovulatory timing is detectable from body odors. Mei et al. (2022) recently used signal detection analyses to show that increased scent attractiveness during the fertile window was not substantial enough to reliably diagnose ovulatory timing. That finding left open the possibility that diagnostic cues of ovulatory timing might be revealed via adaptive patterns of responses to scents, such as reactive hormone changes. The present results failed to detect any such putatively adaptive responses, however, and thus argue against that possibility.

The present study addressed only odor cues of ovulatory timing. Cues from other sensory modalities could in principle provide more information, or a combination of cues across modalities could prove more diagnostic. With respect to the latter possibility, Miller and Maner (2011) provided evidence that men who interacted in person with a woman confederate were more likely to mimic her movements and to increase their risk-taking when her estimated conception risk was higher. Perhaps in cases like that, a combination of odor, voice, face, and behavioral cues might more accurately cue fertile window timing.

The strongest tests of multi-modal cuing of ovulatory timing should in principle come from studies that measure the responses of women's long-term romantic partners to the women's cycle phases. Such partners should have the most intimate and detailed information regarding changes in any perceptible stimuli, and would also have clear functional reasons to respond to cues of ovulatory timing for the purpose of ensuring paternity confidence. A recent study of nearly 400 couples with preregistered data analyses and many thousands of observations found no significant effects of women's estimated fertile window timing on male partners' ratings of the women's attractiveness, sexual desire for their partners, feelings of jealousy, or levels of attention to and desire to have contact with the women (Schleifenbaum et al., 2022). Those findings corroborate earlier studies that have generally found that men's rates of sexual initiation are flat across phases of their partners' menstrual cycles (Adams, Gold, & Burt, 1978Caruso et al., 2014Van Goozen, Wiegant, Endert, Helmond, & VandePoll, 1997; cf. Harvey, 1987). Likewise, and pertinent to the hormonal responses tested in the current study, studies have failed to find significant shifts in men's testosterone concentrations across different phases of their romantic partners' menstrual cycles (Ström, Ingberg, Druvefors, Theodorsson, & Theodorsson, 2012Ström, Ingberg, Slezak, Theodorsson, & Theodorsson, 2018). Collectively, these patterns are unexpected if women's body odors provide diagnostic information regarding their ovulatory timing, or if multi-modal stimulus cues jointly reveal fertile window timing.

50-70% of all dreams include residue from the previous day, especially in the early stages of sleep, while later stages refer to more distant memories

Memory reactivations during sleep: a neural basis of dream experiences? Claudia Picard-Deland et al. Trends in Cognitive Sciences, March 22 2023. https://doi.org/10.1016/j.tics.2023.02.006


Abstract: Newly encoded memory traces are spontaneously reactivated during sleep. Since their discovery in the 1990s, these memory reactivations have been discussed as a potential neural basis for dream experiences. New results from animal and human research, as well as from the rapidly growing field of sleep and dream engineering, provide essential insights into this question, and reveal both strong parallels and disparities between the two phenomena. We suggest that, although memory reactivations may contribute to subjective experiences across different states of consciousness, they are not likely to be the primary neural basis of dreaming. We identify important limitations in current research paradigms and suggest novel strategies to address this question empirically.


Systematic review of all published fMRI research on psychopathy: No reproducible evidence suggests that psychopathy is associated with a functional neurobiological profile

Jalava, J., Griffiths, S., & Larsen, R. R. (2023). How to keep unreproducible neuroimaging evidence out of court: A case study in fMRI and psychopathy. Psychology, Public Policy, and Law, 29(1), 1–18. Feb 2023. https://doi.org/10.1037/law0000383

Abstract: The amount of neuroimaging evidence introduced in courts continues to increase. Meanwhile, neuroimaging research is in the midst of a reproducibility crisis, as many published findings appear to be false positives. The problem is mostly due to small sample sizes, lack of direct replications, and questionable research practices. There are concerns that a significant proportion of neuroimaging evidence introduced in court may therefore be unreliable. Guidelines governing the admissibility of scientific evidence—Frye and Daubert—are not designed to weed out such data. We propose supplementing Frye and Daubert with minimal reproducibility criteria that allow judges to make informed admissibility decisions about neuroimaging research. To demonstrate how this could work, we subjected functional magnetic resonance imaging (fMRI) findings on psychopathy—evidence that has been admitted in court—to a minimal reproducibility test. A systematic PRISMA search found 64 relevant studies but no sufficiently powered, directly replicated evidence of a psychopathy-related neurobiological profile. This illustrates two things: (a) the probability of false positives in this data set is likely to be unacceptably high and (b) the reproducibility of similar neuroimaging evidence can be evaluated in a straightforward way. Our findings suggest an urgent need to modify admissibility guidelines to exclude low-quality neuroimaging data.

Check also Is the Psychopathic Brain an Artifact of Coding Bias? A Systematic Review. Jarkko Jalava et al. Front. Psychol., April 12 2021. https://www.bipartisanalliance.com/2021/04/is-psychopathic-brain-artifact-of.html

Friday, March 24, 2023

People may not be able to tell if they are envied by another person at a particular moment, but they know who the notoriously envious ones are among the people they have known for a longer time

Lange, Jens, Birk Hagemeyer, Thomas Lösch, and Katrin Rentzsch. 2019. “Accuracy and Bias in the Social Perception of Envy.” OSF Preprints. June 16. doi:10.31219/osf.io/8jc7x

Abstract: Research converges on the notion that when people feel envy, they disguise it towards others. This implies that a person’s envy in a given situation cannot be accurately perceived by peers, as envy lacks a specific display that could be used as a perceptual cue. In contrast to this reasoning, research supports that envy contributes to the regulation of status hierarchies. If envy threatens status positions, people should be highly attentive to identify enviers. The combination of the two led us to expect that (a) state envy is difficult to accurately perceive in unacquainted persons and (b) dispositional enviers can be accurately identified by acquaintances. To investigate these hypotheses, we used actor-partner interdependence models to disentangle accuracy and bias in the perception of state and trait envy. In Study 1, 436 unacquainted dyad members competed against each other and rated their own and the partner’s state envy. Perception bias was significantly positive, yet perception accuracy was non-significant. In Study 2, 502 acquainted dyad members rated their own and the partner’s dispositional benign and malicious envy as well as trait authentic and hubristic pride. Accuracy coefficients were positive for dispositional benign and malicious envy and robust when controlling for trait authentic and hubristic pride. Moreover, accuracy for dispositional benign envy increased with the depth of the relationship. We conclude that enviers might be identifiable but only after extended contact and discuss how this contributes to research on the ambiguous experience of being envied.


Whether intelligence can be achieved without any agency or intrinsic motivation is an important philosophical question; equipping LLMs with agency & intrinsic motivation is a fascinating & important direction for future work

Sparks of Artificial General Intelligence: Early experiments with GPT-4. Sebastien Bubeck et al. Mar 22 2023. https://arxiv.org/pdf/2303.12712.pdf

Abstract: Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4 [Ope23], was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT4 is part of a new cohort of LLMs (along with ChatGPT and Google’s PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4’s performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT. Given the breadth and depth of GPT-4’s capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system. In our exploration of GPT-4, we put special emphasis on discovering its limitations, and we discuss the challenges ahead for advancing towards deeper and more comprehensive versions of AGI, including the possible need for pursuing a new paradigm that moves beyond next-word prediction. We conclude with reflections on societal influences of the recent technological leap and future research directions.

---
For example, whether intelligence can be achieved without any agency or intrinsic motivation is an important philosophical question. Equipping LLMs with agency and intrinsic motivation is a fascinating and important direction for future work. With 92 this direction of work, great care would have to be taken on alignment and safety per a system’s abilities to take autonomous actions in the world and to perform autonomous self-improvement via cycles of learning. We discuss a few other crucial missing components of LLMs next.

Thursday, March 23, 2023

Experimental evidence that core intertemporal choice anomalies—like extreme short-run impatience, structural estimates of present bias, hyperbolicity & transitivity violations—are driven by complexity rather than time or risk preferences

Complexity and Time. Benjamin Enke, Thomas Graeber & Ryan Oprea. NBER Working Paper 31047. Mar 2023. DOI 10.3386/w31047

Abstract: We provide experimental evidence that core intertemporal choice anomalies -- including extreme short-run impatience, structural estimates of present bias, hyperbolicity and transitivity violations -- are driven by complexity rather than time or risk preferences. First, all anomalies also arise in structurally similar atemporal decision problems involving valuation of iteratively discounted (but immediately paid) rewards. These computational errors are strongly predictive of intertemporal decisions. Second, intertemporal choice anomalies are highly correlated with indices of complexity responses including cognitive uncertainty and choice inconsistency. We show that model misspecification resulting from ignoring behavioral responses to complexity severely inflates structural estimates of present bias.


Female participants who interacted with a female chatbot gave the lowest ratings for goodwill and likeability among all groups

Gender identity and influence in human-machine Communication:A mixed-methods exploration. Weizi Liu, Mike Yao. Computers in Human Behavior, March 20 2023, 107750. https://doi.org/10.1016/j.chb.2023.107750

Abstract: The advancement of conversational technologies stimulates new research agenda on the patterns, norms, and social impacts of human-machine communication (HMC) as a novel process. Conversational agents (CAs), a prevalent example of machines that communicate with users directly, are usually depicted as females in assisting roles. This study intends to explore empirical evidence of how “gendered” technologies might influence HMC and potentially reinforce gender stereotyping in human-human communication. We applied a mixed-methods approach to explore users' gender-related responses and evaluations in the interaction with CAs. First, we observed unrestricted interactions between 36 human participants and Amazon Alexa in a laboratory and qualitatively analyzed the transcripts to detect gendered communication cues. We then conducted a 2 × 3 (participant gender: female vs. male; CA gender: female vs. male vs. neutral) online experiment where 250 participants interacted with a customized chatbot created by the researcher. Results showed participants’ different emotions/tones, engagement, (non)accommodation, as well as credibility, attraction, and likeability evaluations between human-CA gender pairs.


Expressions of Pain, Pleasure, and Fear Are Consistently Rated Due to Chance

“Eye can’t see the difference”: Facial Expressions of Pain, Pleasure, and Fear Are Consistently Rated Due to Chance. Silvia Boschetti, Hermann Prossinger, Tomáš Hladký, Kamila Machová, Jakub Binter. Human Ethology, Volume 37, 046-072,  Nov 26, 2022. https://doi.org/10.22330/he/37/046-072

Abstract: Our research consisted of two studies focusing on the probability of humans being able to perceive the difference between faces expressing pain versus pleasure. As controls, we included: smile, neutral facial expression, and expression of fear. The first study was conducted online and used a large sample (n = 902) of respondents. The second study was conducted in a laboratory setting and involved a stress induction procedure. For both, the task was to categorize whether the facial expression was rated positive, neutral or negative. Stimuli were faces extracted from freely downloadable online videos. Each rating participant (rater) was presented with five facial expressions (stimuli) of five females and of five males. All raters were presented with the stimuli twice so as to evaluate the consistency of the ratings. Beforehand, we tested for stimuli differences using specialized software and found decisive differences. Using a Bayesian statistical approach, we could test for consistencies and due-to-chance probabilities. The results support the prediction that the results are not repeatable but are solely due to chance, decreasing the communication value of the expressions of pain and pleasure. The expression of fear was also rated due to chance, but neither neutral nor smile. Stress induction did have an impact  on the perception of pleasure.


Keywords: perception, emotion, facial expression, visual stimuli, BDSM, pain and pleasure, Dirichlet distribution, Bayesian statistical approach, Cold Pressor Task


Rolf Degen summarizing... There is a fine line between neuroticism and high sensitivity, and the self-diagnosis of high sensitivity brings considerable redemption

On the feeling of being different–an interview study with people who define themselves as highly sensitive. Marcus Roth ,Danièle A. Gubler,Tobias Janelt,Banous Kolioutsis,Stefan J. Troche. PLoS March 17, 2023. https://doi.org/10.1371/journal.pone.0283311

Abstract: The construct of “sensory processing sensitivity” has become an extremely popular concept outside the scientific literature under the term “high sensitivity” (HS), reflected in a variety of self-help guides and media reports. Therefore, the present study aimed to investigate this phenomenon by examining in-depth individuals who consider the label HS essential to their self-definition. In semi-structured interviews, 38 individuals described their understanding of HS and its perceived manifestations and impact on their lives (among other topics). Subsequently, the data were content-analytically evaluated, i.e., categorized and quantified. One key finding was that HS individuals feel relief following self-attribution or self-diagnosis. Moreover, this self-attribution replaced the feeling of being somehow different from the others, which almost all interviewees mentioned, with positive attributes. The main negative features of HS mentioned were feeling overwhelmed by sensory and emotional stimuli. The results are discussed with regard to the significance of the label HS for this group on the one hand, and with regard to alternative approaches for future research on the other hand.

Discussion

As described in the introduction, the construct known variously as sensory processing sensitivity (SPS) or high sensitivity (HS) has gained enormous importance in everyday psychology, going far beyond the construct’s scientific foundation. As the abundance of popular scientific literature and self-help guides shows, many people identify with and feel that they fall under the category of SPS. Therefore, the present study sought to find out how these people define HS, what manifestations they perceive, and what impact HS has on their lives. For this purpose, we conducted interviews with individuals who strongly define themselves as highly sensitive. Of course, it can be assumed that the definition of HSP and the self-perception of individuals is strongly influenced by social media and popular scientific works. Therefore, it is not surprising that the definitional elements we found are very much reflective of the popular scientific literature [see e.g., 73240].

Summarizing the interview statements, the following picture emerges: People see the main characteristic of HS as increased and more intensive perception of emotional and sensory stimuli as well as longer processing of these stimuli. In addition, many subjects describe that they have stronger emotional empathy and are better able to recognize the perspectives of others. This is seen as positive by many, whereas the resulting feeling of exhaustibility and overstimulation is evaluated as negative by most. These data correspond with previous empirical findings that showed that HS is associated with global symptom load [1819], stress, [4142], and anxiety [43]. Overall, as stated also in the scientific literature cited above, the feeling of being overwhelmed is essential to defining HS [e.g., 517].

Despite these sometimes stressful experiences, almost all of the participants interviewed reported predominantly positive feelings when they first heard about HS. For many, this amounted to an attestation of “being normal”. Many saw themselves as part of a larger community and no longer as outsiders. The feeling of being somehow different from others, which almost all interviewees mentioned, was replaced with positive attributes. Thus, identification with HS can be described as "liberation" from the feeling of being deficient for most participants in this study. Correspondingly, a majority reported greater self-acceptance, especially since many explicitly described HS as a special ability. One participant summarized this connection in a particularly impressive way: “So, if you have always this stamp on your forehead that you are different, then it is very nice to hear that there is a cause for it–that it is not a disorder, but actually a special ability.”

At this point, we would like to make a first attempt to relate this pattern of results to the lack of separability between HS and neuroticism [e.g., 61831]: Neuroticism is commonly evaluated negatively, as seen when individuals are asked to report their personality traits under “faking good” instructions [e.g., 4445]. This is likely reinforced by terms such as "emotional lability". Here, only the negative side of high neuroticism is considered, while positive features related to increased emotionality are not included—neither in the description of this personality trait nor in the items measuring it. In contrast to traits like “neurotic” or “introverted”, the term “highly sensitive” appears to be positively connotated. Furthermore, this term not only describes deficits, but also includes strengths of high neuroticism. In this way, the concept of HS might be a (quite desirable) way to free neuroticism from its purely deficit-based characterization. As shown by our results, HS people described suffering as a result of the pathologization of their emotionality and therefore experienced the label “highly sensitive” as liberating. In principle, a neutral label for a basic personality trait seems necessary. However, the problem with HS could be that the same mistake, namely judgmental labelling, is now made in the reverse direction: HS is posited as a positive trait by the flower metaphor [12946], for example, according to which people are divided into “dandelions” (i.e. low sensitivity), “tulips” (medium sensitivity), and “orchids” (i.e. high sensitivity). Here, it seems useful to find a middle ground in terminology–something between “disturbed neurotics” and “the elected few of the human race”(to put it in rather pointed terms).

Interestingly, a recent study was able to demonstrate links between SPS and both vulnerable and grandiose narcissism [47].

In addition to highlighting people’s need to receive a neutral or positive description of their personality in order to be able to accept themselves, the present study can also advance scientific research. Of course, it remains possible that SPS actually exists as a trait but has so far been insufficiently conceptualized and measured. As mentioned above, it is currently difficult to distinguish HS from neuroticism, introversion and openness. Undoubtedly, one reason for this is the HSPS, which contains a large number of items measuring neuroticism, extraversion, and openness. However, this should not be surprising given how the HSPS items were generated. To extract the basic characteristics of HS people, Aron and Aron [5] asked students from university psychology classes to interview “‘highly sensitive people’—that is, those who are ‘either highly introverted (for example, preferring the company of one or two people) or easily overwhelmed by stimulation (such as noisy places or evocative or shocking entertainment)”. When manifestations of introversion and neuroticism are used as inclusion criteria, it is not surprising that items measuring introversion and neuroticism emerge as a result. It is possible that the “wrong people” were interviewed through this procedure. In contrast, the present study takes a more neutral approach and could serve as a start point for the development of an alternative scale with items that do not measure neuroticism and introversion, but refer primarily to what is specific to HSP.

Nevertheless, the biased sample characteristics can be viewed as limitations of the present study: The vast majority of participants were female and highly educated. These tendencies may not be uncommon in psychological studies, but were especially strong in the current study. However, this is not really surprising due to the recruitment procedure. Furthermore, although N = 38 is considerable for a qualitative sample, this sample size lacks representativeness and therefore must be viewed critically when it comes to generalizability. However, the consistent pattern of our results allows us to assume that the present study’s findings do allow a certain degree of generalization. Of course, such a generalization can only be valid for the German cultural area. Since this is the first study that explores people who define themselves as highly sensitive, information on cultural differences is unfortunately not available. However, the specific ways in which HS manifests “as a blessing and a burden” [47] in different cultures should be an interesting question for future research.


Tuesday, March 21, 2023

Adult men view criminal records as less of a hindrance to partner selection than adult women

She’s Not That into You: Speed Dating with a Criminal Record. Douglas N. Evans & Noreen Ali. Corrections, Mar 13 2023. https://doi.org/10.1080/23774657.2023.2190550

Abstract: Prosocial relationships are beneficial to post-conviction reintegration, but criminal stigma may limit romantic relationship access. This study implements an experimental audit of speed dating, which allows people to meet several potential partners in a brief time, to explore how conviction disclosure, offense type, and attractiveness and personality ratings affect dating interest. Three women and three men confederates of different races/ethnicities were randomly assigned to a control or one of three offense conditions before interacting one-on-one with 64 participants in 4-minute Zoom Q&A speed dating sessions. Following each interaction, participants rated one another on attractiveness, personality dimensions, and interest in dating. Findings indicate that disclosure of property offense conviction significantly reduced women’s willingness to date men confederates while assault and drug convictions did not negatively affect women’s dating interest. Women confederate disclosures of convictions did not affect men’s interest in dating them. Researching the effects of prior convictions on romantic relationship interest is challenging but important in revealing how criminal stigma varies by offense type to affect relationship capital.


Keywords: Speed datingcriminal history disclosureraceattractivenessstigma


Worth the Risk? Greater Acceptance of Instrumental Harm Befalling Men than Women

Worth the Risk? Greater Acceptance of Instrumental Harm Befalling Men than Women. Maja Graso, Tania Reynolds & Karl Aquino. Archives of Sexual Behavior, March 17 2023. https://link.springer.com/article/10.1007/s10508-023-02571-0

Abstract: Scientific and organizational interventions often involve trade-offs whereby they benefit some but entail costs to others (i.e., instrumental harm; IH). We hypothesized that the gender of the persons incurring those costs would influence intervention endorsement, such that people would more readily support interventions inflicting IH onto men than onto women. We also hypothesized that women would exhibit greater asymmetries in their acceptance of IH to men versus women. Three experimental studies (two pre-registered) tested these hypotheses. Studies 1 and 2 granted support for these predictions using a variety of interventions and contexts. Study 3 tested a possible boundary condition of these asymmetries using contexts in which women have traditionally been expected to sacrifice more than men: caring for infants, children, the elderly, and the ill. Even in these traditionally female contexts, participants still more readily accepted IH to men than women. Findings indicate people (especially women) are less willing to accept instrumental harm befalling women (vs. men). We discuss the theoretical and practical implications and limitations of our findings.

General Discussion

The current investigation sought to examine whether people were more willing to endorse interventions when IH was borne by men than women. Our first two studies supported this premise. Importantly, however, our results showed that this asymmetry was driven primarily by women, but not men, being more likely to accept IH to men than to women across a variety of contexts (i.e., supporting Hypothesis 2). Study 3 tested a boundary condition to this gender bias in harm tolerance: stereotypically female caregiving contexts. When instrumental harm benefitted vulnerable individuals (e.g., infants, young children, sick, or the elderly), both men and women exhibited a bias in their willingness to accept IH to men versus women (i.e., supporting Hypothesis 1; not supporting Hypothesis 3). That is, contrary to what might be expected by historical gender roles (Eagly & Wood, 1999), people believed men ought to bear greater costs, even in traditionally female sacrificial domains.

Theoretical and Practical Implications

Our findings offer four contributions. First, we extended the literature on gender and harm endorsement, which has primarily emphasized high-conflict sacrificial dilemmas involving questions of life or death (e.g., FeldmanHall et al., 2016; Skulmowski et al., 2014). The current findings revealed this gender bias persists in highly consequential, yet understudied domains: assessments of beneficial interventions carrying negative externalities across a variety of contexts: medical, psychological, educational, sexual, and caregiving. Second, we demonstrated that when evaluating interventions, female participants were more likely than male participants to accept IH borne by men than women. This pattern lends further support to the well-documented finding that women have a stronger in-group bias than men (e.g., Glick et al., 2004; Rudman & Goodwin, 2004) and are more likely to perceive one another as victims than perpetrators (Reynolds et al., 2020). This disparity suggests women may prioritize one another’s welfare over men’s in the construction or approval of social, educational, medical, and occupational interventions. If so, female policymakers might be especially wary of advancing policies or initiatives risking harm to other women, but less so when they risk harming men.

Third, we tested a boundary condition to this gender bias by investigating contexts previously unstudied in sacrificial dilemmas: stereotypically female caregiving roles. Although consideration of gender stereotypes and role congruence (Eagly & Wood, 1999) might predict a greater tolerance for female sacrifice in such contexts, men and women alike were more tolerant of IH incurred by men (versus women). These patterns suggest that although women traditionally fill and sacrifice in these roles, people may not necessarily endorse that ought to be the case. Rather, our results align with emerging evidence documenting diminished concern for men’s suffering due to a greater tendency to stereotype men as perpetrators rather than victims (Reynolds et al., 2020).

Fourth, our findings identified individual-level factors that contribute to asymmetries in harm tolerance. Namely, Studies 2 and 3 revealed that individuals more strongly endorsing egalitarian, feminist, or liberal ideologies exhibited greater disparities in their acceptance of instrumental harm, such that they more readily tolerated instrumental harm borne by men. These patterns suggest those most concerned about rectify- ing historical injustices might most ardently oppose explora- tory interventions potentially providing long-term benefits to women.

Limitations, Emerging Questions, and Future Directions

Although the current investigation has its strengths (e.g., consistent results across varied contexts, within and between-person designs, diverse beneficiaries, pre-registrations), it is not without limitations. First, future investigations might profit, for example, from examining contexts that explicitly signal one’s willingness to sacrifice on behalf of others (e.g., voluntary military service or blood donation) to determine the generalizability of these patterns. Second, our conclusions are limited by our reliance on American MTurk and CloudResearch users. Thus, our results might not generalize to other contexts and cultures. Indeed, changes in stereotypes over time (Charlesworth & Banaji, 2022), and cultural differences in norms surrounding masculinity and femininity might shift beliefs about the value of IH incurred by men versus women (see Glick et al., 2004 for a cross-cultural comparison of attitudes toward men and women). Examining whether the reluctance to expose women to instrumental harm emerges across cultures remains an open avenue for future work. Moreover, our data were collected during the earlier days of COVID-19, which could have influenced the composition or motivations of our samples (Arechar & Rand, 2021). Thus, replication is warranted before strong conclusions can be inferred.

Fourth, although the results of Studies 1 and 2 consistently revealed women’s gender bias in instrumental harm acceptance, their methods could not disentangle whether the bias more strongly emerged from an aversion toward harming women or a desire to benefit women. That is, because both studies pit harm to one sex against the benefit to the other, it is unclear which more strongly contributed to these findings. That Study 3’s female participants (along with male) more readily tolerated men’s (versus women’s) suffering in contexts benefitting vulnerable individuals (rather than women) suggests the possibility Studies 1 and 2’s results reflected women’s greater aversion to harming fellow women, rather than a motivation to benefit them per se. Nonetheless, future research might examine interventions whereby only one sex is benefitted or harmed to adjudicate the relative contribution of these two factors.

Altogether, our findings point to potentially consequential implications for laypeople’s perceptions of exploratory interventions and programs. The asymmetry we documented may place disparate pressures on researchers and policymakers to intervene experimentally on men’s versus women’s afflictions in ways that minimize instrumental harm to women. The biases uncovered here suggest the possibility that women were excluded historically from exploratory research due to an aversion toward inflicting instrumental harm onto women, such as in medicine (Holdcroft, 2007). This ultimately proved costly to women, as men’s overrepresentation in medical research yielded treatments more effective among men than women (Holdcroft, 2007). Thus, although such an aversion may have benefitted women in the short term because women were spared incidental harm imposed by risky experiments, in the long run, experimentation on men unearthed medical and safety advancements better suited for male bodies. Experimental examinations and interventions carry both costs and benefits. If, as our results suggest, people are less willing to accept instrumental harm befalling women, women might lose out on the long-term benefits of such experimental endeavors.

Throughout history, countless male lives have been sacrificed on the battlefield, ostensibly to promote the greater good (Baumeister, 2010). Our findings suggest that these sentiments persist beyond the field of combat. For many people, accepting instrumental harm to men is perceived as worth the cost to advance other social aims. We invite researchers to further investigate how individuals appraise the value of suffering and whether those appraisals differ across target characteristics. A deeper understanding of the biases embedded in such calculations may minimize the unforeseen and unintended consequences of those preferences, thereby reducing harm to men and women alike.

We found a significant increase in pseudo-event coverage, expressing a more positive tone than genuine event coverage; moreover, political pseudo-event coverage shows quadrennial cycles with peaks in each presidential election year

Pseudo-events: Tracking mediatization with machine learning over 40 years. Mengyao Xu, Lingshu Hu, Amanda Hinnant. Computers in Human Behavior, Volume 144, July 2023, 107735. https://doi.org/10.1016/j.chb.2023.107735

Abstract: Using automated content analysis, this research explores the phenomenon of pseudo-events coverage in The New York Times (N = 70,370 articles) from 1980 to 2019. By clarifying the operationalization of pseudo-events, this study introduces pseudo-events as a valuable tool to index how different social subsystems perpetuate mediatization (which is when institutions absorb and abide by media logic). Machine-learning classifiers were constructed to measure pseudo-events, which provides historicity, specificity, and measurability — three tasks set forth for new mediatization research. We found a significant increase in pseudo-event coverage, expressing a more positive tone than genuine event coverage. Moreover, political pseudo-event coverage shows quadrennial cycles with peaks in each presidential election year. Our findings reveal the expansion of mediatization since 1980 and show how media logic has been internalized in different ways by the social subsystems of politics, culture, and economics. Institutions and their social actors need efficient tools to abide by media logic in seeking publicity and commanding authority, and pseudo-events have matured into one of the most dominant tools, especially for political actors. This study offers an innovative approach to capture complex phenomena and shows promises of broader application of machine learning to empirically quantify and identify patterns using theoretical concepts.


Monday, March 20, 2023

Acquiring knowledge by Googling gives people a greater illusion of understanding than passively reading the same information, especially when the search results feature snippets

Understanding Why Searching the Internet Inflates Confidence in Explanatory Ability. Emmaline Drew Eliseev, Elizabeth J. Marsh. Applied Cognitive Psychology, March 11 2023. https://doi.org/10.1002/acp.4058

Abstract: People rely on the internet for easy access to information, setting up potential confusion about the boundaries between an individual's knowledge and the information they find online. Across four experiments, we replicated and extended past work showing that online searching inflates people's confidence in their knowledge. Participants who searched the internet for explanations rated their explanatory ability higher than participants who read but did not search for the same explanations. Two experiments showed that extraneous web page content (pictures) does not drive this effect. The last experiment modeled how search engines yield results; participants saw (but did not search for) a list of hits, which included “snippets” that previewed web page content, before reading the explanations. Participants in this condition were as confident as participants who searched online. Previewing hits primes to-be-read content, in a modern-day equivalent of Titchener's (1921) example of a brief glance eliciting false feelings of familiarity.


Despite the wide educational use of Milgram’s studies to increase people’s awareness of the risks inherent to blind obedience, it may be that this knowledge only serves to evaluate other’s behaviors, and not oneself

The blind obedience of others: a better than average effect in a Milgram-like experiment. Laurent Bègue & Kevin Vezirian. Ethics & Behavior, Mar 15 2023. https://doi.org/10.1080/10508422.2023.2191322

Abstract: In two highly powered studies (total N = 1617), we showed that individuals estimated that they would stop earlier than others in a Milgram-like biomedical task leading to the death of an animal, confirming the relevance of the Better than Average Effect (BTAE) in a new research setting. However, this effect was not magnified among participants displaying high self-esteem. We also showed that participants who already knew obedience studies expected that others would be more obedient and would administer more damaging treatment to the target. However, knowledge of Milgram’s studies was unrelated to a higher estimate of their own behavior (study 1), and was even linked to the prediction that they would stop earlier (study 2, preregistered). Despite the wide educational use of Milgram’s studies to increase people’s awareness of the risks inherent to blind obedience, it may be that this knowledge only serves to evaluate other’s behaviors, and not oneself.


Sunday, March 19, 2023

The worn-out idea of "stereotype threat" suffers another defeat in a failed replication, playing no role in women's lower level of political knowledge

Does Stereotype Threat Contribute to the Political Knowledge Gender Gap? A Preregistered Replication Study of Ihme and Tausendpfund (2018). Flavio Azevedo, Leticia Micheli, Deliah Sarah Bolesta. Journal of Experimental Political Science, March 16 2023. https://doi.org/10.1017/XPS.2022.35

Abstract: The gender gap in political knowledge is a well-established finding in Political Science. One explanation for gender differences in political knowledge is the activation of negative stereotypes about women. As part of the Systematizing Confidence in Open Research and Evidence (SCORE) program, we conducted a two-stage preregistered and high-powered direct replication of Study 2 of Ihme and Tausendpfund (2018). While we successfully replicated the gender gap in political knowledge – such that male participants performed better than female participants – both the first (N = 671) and second stage (N = 831) of the replication of the stereotype activation effect were unsuccessful. Taken together (pooled N = 1,502), results indicate evidence of absence of the effect of stereotype activation on gender differences in political knowledge. We discuss potential explanations for these findings and put forward evidence that the gender gap in political knowledge might be an artifact of how knowledge is measured.

Discussion

Ihme and Tausendpfund (Reference Ihme and Tausendpfund2018) have proposed that the activation of negative gender stereotypes accounts for the variance of the political knowledge gender gap. In our independent and well-powered direct replication, we find no evidence that activation of gender stereotypes affects participants’ performance in a political knowledge test. Indeed, we find evidence of absence of this effect.

We note that some elements of our study design diverged from the original study and could have contributed to the observed non-replication. Our study was conducted with American students and working adults, whereas the original study included German students. As the United States has achieved relatively lower gender parity than Germany in political empowerment (World Economic Forum 2021), one could argue that negative stereotypes about women might be more salient for Americans than Germans, undermining women’s cognitive performance even in the absence of stereotype activation (e.g., in the control condition). Although we cannot rule out that some populations might be more vulnerable to gender stereotyping than others, we have reduced cultural biases as much as possible by devising a political knowledge test that was – at the same time – similar to the one used in the original study regarding the level of difficulty, as our data suggest, and relevant to the American political context. A comparison of the effect of stereotype threat on gender differences in political knowledge across countries with varying levels of gender equality would be beneficial for a better understanding of potential cultural differences in stereotype threat. Second, as a direct consequence of including working adults in our sample, it was necessary to adapt the measure of field of study to encompass the field of work. We argue, however, that this should not have contributed to the unsuccessful replication. If our measure of field of study/work would inadvertently make participants aware of their affiliation with a Politics or Non-Politics group, the effects of gender stereotype activation on performance would presumably become more salient. Instead, our results show that the field of study/work did not influence the results (Tables S16S17). An argument can be made, however, that the extensive list of topics in our study reduced participants’ self-identity with Politics. Nevertheless, adding participants’ attributed importance of Politics to their study/work as a covariate in the analyses did not change results (Tables S18S19). We have also conducted further tests restricting our sample to young and educated adults to achieve a sample more similar in composition to the respondents in the original study, but we could still not replicate the effect of stereotype activation on the gender gap in political knowledge (Table S20).

We note that our failure to replicate the effect of stereotype threat on gender differences in political knowledge is consistent with recent research efforts challenging the effect of stereotype threat on academic performance more broadly. Stoet and Geary (Reference Stoet and Geary2012) showed that only 30% of efforts aiming to replicate the gender gap in mathematical performance do succeed. In addition, a meta-analysis investigating the effect of gender stereotype threats on the performance of schoolgirls in stereotyped subjects (e.g., science, math) indicated several signs of publication bias within this literature (Flore and Wicherts Reference Flore and Wicherts2015). Given these results, it is plausible that the effect of gender stereotype activation might be small in magnitude and/or might be decreasing over time (Lewis and Michalak Reference Lewis and Michalak2019).

Furthermore, we find robust evidence of a gender gap in political knowledge even after controlling for political interest. Our results validate previous accounts that the gender gap on political knowledge may be an artifact of how knowledge is conceptualized and measured and of different gender attitudes toward standard tests. In line with previous research stating that the political knowledge gap might be artificially inflated by a disproportionate amount of men who are willing to guess rather than chose the “don’t know” option – even if that might lead to an incorrect answer (Mondak and Anderson Reference Mondak and Anderson2004) – we find that female participants attempted to answer less questions and used the “don’t know” response option in the political knowledge test more frequently than their male counterparts whereas men guessed their answers more frequently than women, resulting in a larger amount of incorrect answers (Tables S8S14). This suggests factors other than knowledge might contribute to the gender gap in political knowledge (Mondak Reference Mondak1999). For example, gender differences in risk taking and competitiveness (Lizotte and Sidman Reference Lizotte and Sidman2009) as well as in self-confidence (Wolak Reference Wolak2020) and self-efficacy (Preece Reference Preece2016) may lead women to second-guess themselves and be less prone to attempt answering the questions of which they are unsure. Meanwhile, higher competitiveness and confidence in males might lead them to guess and “gain the advantage from a scoring system that does not penalize wrong answers and rewards right ones” (Kenski and Jamieson Reference Kenski, Jamieson and Jamieson2000, 84). Measurement non-invariance, too, appears to detrimentally affect the interpretation and validity of political knowledge scales across several sociodemographics. For example, Lizotte and Sidman (Reference Lizotte and Sidman2009) and Mondak and Anderson (Reference Mondak and Anderson2004) have shown political knowledge instruments violate the equivalence assumption for gender, while Abrajano (Reference Abrajano2015) and Pietryka and MacIntosh (Reference Pietryka and MacIntosh2013) found non-invariance across age, income, race, and education. In our own replication attempt, we also found evidence of measurement non-invariance using item response theory and showed that the magnitude of the gender systematic bias appears to be contingent on respondents’ knowledge levels such that lack of equivalence by gender is stronger at average scores and weaker at the extremes of the political knowledge continuum (see Table S21 and Figure S1).

As Politics has been essentially a male-dominated field since its creation, it should not come as a surprise that current measures of political knowledge tend to favor what men typically know. Previous studies have shown that the mere inclusion of gendered items on scales of political knowledge lessens the gender gap (Barabas, Jerit, Pollock, and Rainey Reference Barabas, Jerit, Pollock and Rainey2014; Dolan Reference Dolan2011). The investigation and validation of measures of political knowledge that capitalize on the fact that men and women might not only know different things but also may react in different ways to standard tests is paramount for a more accurate understanding of the gender gap in political knowledge and its bias.

Finally, we note that measurement issues are not unique to political knowledge and in fact are pervasive in Political Science with consequences for how we measure populism (Van Hauwaert, Schimpf, and Azevedo Reference Van Hauwaert, Schimpf and Azevedo2018Reference Van Hauwaert, Schimpf and Azevedo2020; Wuttke, Schimpf, and Schoen Reference Wuttke, Schimpf and Schoen2020), operational ideology (Azevedo and Bolesta Reference Azevedo and Bolesta2022; Azevedo, Jost, Rothmund, and Sterling Reference Azevedo, Jost, Rothmund and Sterling2019; Kalmoe Reference Kalmoe2020), and political psychological constructs such as authoritarianism, racial resentment, personality traits, and moral traditionalism (Azevedo and Jost Reference Azevedo and Jost2021; Bromme, Rothmund, and Azevedo Reference Bromme, Rothmund and Azevedo2022; Pérez and Hetherington Reference Pérez and Hetherington2014; Pietryka and MacIntosh Reference Pietryka and MacIntosh2022). If the basic measurement properties of widely used constructs are flawed, it is likely that insights from research will be biased. Valid, invariant, and theoretically derived instruments are urgently needed for the reliable accumulation of knowledge in Political Science.

Both left and right agree that it's the bad, divisive stuff that goes viral on social media that least deserves it, while the good stuff is not amplified as it should be

Rathje, Steve, Claire Robertson, William J. Brady, and Jay J. Van Bavel. 2022. “People Think That Social Media Platforms Do (but Should Not) Amplify Divisive Content.” PsyArXiv. October 11. doi:10.31234/osf.io/gmun4

Abstract: There is widespread debate about how to improve or regulate social media algorithms. We review the type of content that is most likely to spread widely, or go “viral” on social media, and describe how people’s perceptions of what goes viral does not match their preference about what should go viral. We recruited a nationally representative sample of US participants and surveyed them about their perceptions of social media virality (n = 511). In line with prior research, people believe that divisive content, moral outrage, negative content, high-arousal content, and misinformation are all likely to go viral online. However, people reported that this type of content should not go viral on social media. Instead, people reported that many forms of positive content – such as accurate content, nuanced content, and educational content – are not likely to go viral, even though they think this content should go viral. Importantly, these perceptions were widely shared, and were only weakly related to political orientation, social media usage, and demographic variables. In sum, there is broad consensus around the type of content people think social media platforms should and should not amplify, which can help inform solutions for improving social media.