Had I Been a Reviewer. Actually, I was a reviewer. Huh! So, how did this turn out?
McNulty, Maxwell, Meltzer, & Baumeister make use of two cohorts of newlyweds to find out whether discrepancies in sexual desire contribute to reducing marital satisfaction. The study is based on the same data as McNulty et al. 2016 and tries to answer a very similar question. The only difference is that here we are looking at sexual desire (which was measured less often) rather than sexual satisfaction and frequency as the predictor, and there’s an added consideration of childbirth and stress as mediating factors. I reviewed a previous version of this manuscript at another journal, so this blog post is an edited version of that review minus the points the authors addressed1.
In all, I think the results are plausible and the data are rich, but I wanted to think through some important limitations of their data (some of them discussed at length, some less so) to figure out how I think about the results.
One key limitation is not addressed by design. Many couples never marry, and most couples don’t marry right after meeting. We may be looking at a very heterogeneous group here with respect to relationship duration. Why is that important? For example, couples who marry after having been together for longer may be more likely to have children. We would then spuriously conclude effects of childbirth that are in reality driven by pre-existing differences in relationship duration. The authors did not share data on relationship duration preceding marriage. Their data on newlyweds are obviously valuable, but for answering the specific research question posed, I’d hazard starting with unmarried couples would give us clearer answers.
At the end, the authors say that hormonal fluctuations may be proximal mechanisms by which desire changes.
Yet, they do not discuss menstrual cycles, hormonal contraception, pregnancy and breastfeeding. All of these entail hormonal changes. Let’s think this through: If newlywed women are more likely to be already pregnant at the first timepoint, changes in sexual desire after birth might rather reflect a shift from pregnancy back to breastfeeding and/or regularly ovulating or hormonal contraception.2
Hormonal contraception causes small decreases in sexual desire on average. If newlywed women are more likely to have gone off the pill, they may temporarily have higher sexual desire in the first wave, and then decrease again after a return to hormonal contraception after birth.
More mechanistically, given that we found that women experiences peaks in sexual desire before ovulation, it may have been more likely that women were asssessed around ovulation during the first wave and less likely after childbirth (given that both breastfeeding and combined hormonal contraceptives can suppress ovulation).
Finally, with the age range of their sample, I don’t think anyone entered menopause during the study period, but I still would have liked to see their analyses adjusted for age.
Are the effects reported by the authors small enough to be fully explained by these slightly roundabout explanations? In standard deviations, the effect is approximately a decrease of 0.18 across both studies, if I understood their table correctly. I don’t know how many women change their contraceptive method after marriage in their sample, nor do they report when and how many couples had children, so there are a lot of unknowns here.
Combined with the problem that relationships don’t start at marriage, I do wonder if we were really shown evidence of a linear decline in sexual desire or whether we just started following couples right after a small uptick in sexual desire after the honeymoon (for hormonal and many other plausible reasons). I think getting this right makes a real difference to the counseling of couples.
I don’t really know what the right approach would be here, but certainly the authors should have mentioned how they dealt with missing data (FIML? Listwise deletion?) and I don’t think adjusting for the number of waves does much good. Presumably, dissatisfied couples are more likely to drop out, because they divorce or separate, so a censored model could be appropriate for the marital dissatisfaction analyses. In all, I think these problems could lead to an underestimation of effects, but I could be wrong about this because I don’t know the details of the model the authors fit.
In all, the conclusions were tempered a lot between the version I saw and this one; in fact, I feel like I can see the signature of certain well-known reviewers in the limitations section. But these two slipped through:
quality close relationships are a significant source of mental and physical health
This is an unsupported causal claim. The correlation may well be due to reverse causation or unobserved third variables. It would have be supported by something stronger than meta-analyses of correlations (e.g., propensity score matching). I’d add that divorce does not imply that people will necessarily end up lonely; they may end up happier. There’s still friends, new partners, or remarriage. To make the claim you’re trying to make, it would be more apt to cite negative consequences of divorce for well-being and mortality, rather than show positive correlates of marriage and assume that those whose marriages end will be alone.
desire for sex dwindles among newly married women but not men
Given the small effect size, the word “dwindles” is misleading.
Given that the last two papers I did HIBARs on provided open data and one provided open analysis scripts, I have to point out that I really felt the lack of reproducibility for this paper. The data is not open and there is no syntax for the quite complex models. To my mind, this makes issues with things like missing data handling much more pressing, because I and others cannot easily re-analyse the data.
There is still at least one inconsistent p value in here (as identified by Statcheck.io), even after I recommended Statcheck in my review. I could not reproduce the authors measure of within-subject variability (presumably because I could not figure out how they went about it), although I could reproduce the directional result using my own approach.
There is no codebook for the two studies, so I have to rely on various descriptions of the data strewn across multiple papers to find out what they did and did not measure.
They report Cronbach’s α, but no retest reliabilities, nor reliabilities of change3; both of which of would be relevant to the question of whether they had adequate power. My admonition that they needed to provide more detail for their power analysis (e.g., predicted effect sizes) for it to make sense was apparently dealt with by omitting the power analysis entirely.
Figure 1 just shows simple means with standard errors. A spaghetti plot (showing trajectories for all couples) or a smoothed spline over time superimposed on raw data would have done justice to the data.
Further, given the birth of children is consistently linked to marital satisfaction, a potential confound, we controlled marital satisfaction in these analyses as a time-varying covariate.
Marital satisfaction is not just a potential confound, it’s also a potential outcome of lower sexual desire. Adjusting for it is hence not straightforward in a timeseries with long lags like this. I would like to see the results without this adjustment.
The authors use the infamous terms “marginally significant” and “trended toward significance”. As many others have pointed out, p values aren’t Geiger counters that tell you when you’re approaching the truth.
While I found the paper interesting in general, I took less away from this than I could have. Of course, not everyone has a focus on hormones in their work, but given the limitations discussed above I cannot integrate their findings with what we already know about sexual desire and hormones. I hope future work revisits these issues.
for example, they pulled in the sexual frequency and satisfaction data here to link the two papers a bit more↩︎
The authors never discuss collecting information on whether women were pregnant when measured, although I assume they could back-calculate that from children’s birth dates (if those were collected) or approximately infer it depending on whether any children were born between waves (they definitely collected that).↩︎
Cronbach’s alpha is not a sufficient measure of reliability to report for longitudinal measures such as these. I recommend reporting multilevel generalizability (Shrout and Lane 2012), as implemented e.g. in the psych package by Revelle. Especially the coefficient reliability of change is interesting for the analyses reported. Also, simply reporting that coefficients were “more than .90” is too imprecise.↩︎
If you see mistakes or want to suggest changes, please create an issue on the source repository.
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at https://github.com/rubenarslan/rubenarslan.github.io, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
For attribution, please cite this work as
Arslan (2019, Sept. 3). One lives only to make blunders: HIBAR: Sex‑Differentiated Changes in Sexual Desire Predict Marital Dissatisfaction. Retrieved from https://rubenarslan.github.io/posts/2019-09-03-hibar-sexdifferentiated-changes-in-sexual-desire-predict-marital-dissatisfaction/
BibTeX citation
@misc{arslan2019hibar:, author = {Arslan, Ruben C.}, title = {One lives only to make blunders: HIBAR: Sex‑Differentiated Changes in Sexual Desire Predict Marital Dissatisfaction}, url = {https://rubenarslan.github.io/posts/2019-09-03-hibar-sexdifferentiated-changes-in-sexual-desire-predict-marital-dissatisfaction/}, year = {2019} }