My thanks to Valerio Capraro, the first author of the consensus statement, who gave me feedback on an earlier version of this post.
A large working group of psychologists recently put out a consensus statement on social media and mental health effects, based on Haidt’s book “The Anxious Generation”.
Many of the scholars I respect the most in this space (e.g., Lucy Foulkes, Amy Orben, Andrew Przybylski) seem to have not participated or dropped out of this effort. I know that in at least two cases this was because they did not trust the process. I am not an expert in this field myself, but I did read “The Anxious Generation”, “What Mental Illness Really Is … And What It Isn’t” and a few other things and have some familiarity with how strong the evidence base is. Now, personally, I think this consensus effort is premature. For many of the claims they examined we don’t have the evidence to assess many of these claims well. Consensus is not a shortcut to understanding the world.
In the end, the participants found consensus for only one causal claim (“Heavy smartphone and social media use can cause sleep problems.”), several weak claims about associations (e.g., “social media may be associated with visual social comparison in girls”) as well as a claim that adolescent mental health has declined in several Western countries. To me, not even the evidence for this mental health decline is very compelling. But generally this consensus is not far away from my own current understanding. What irks me is how this was communicated and what they chose to focus on.
It’s not your fault if recent social media posts led you to believe there was a “ton of consensus” that “Adolescent mental health has declined” or that rejecting this consensus is akin to denying climate change.
Screenshot of now deleted skeet by Jay van Bavel
However, even if I don’t think the consensus statement has much value, I think it’s meta-scientifically interesting to look at how the participating scholars assessed the level of evidence for each claim. Maybe this will also give us an idea what kind of experts self-selected into the process. Potentially, the Delphi method corrected for all problems. I am not trying to assess that. I am only looking at Study 1 here to understand “how the sausage gets made”. Similarly, I found it interesting to read how the first author dealt with critical comments, documented on OSF.
Naively, you might expect that sometimes, when people have expertise in one area but not another, they would not answer questions in those areas where they cannot assess the evidence base. You would not expect them to report a stronger evidence base than exists. Consider replicated experimental field studies. These are very effortful studies, done either at considerable expense or making use of widely reported policy changes. You certainly wouldn’t expect a researcher to say such studies exist in their field of expertise when they don’t. In fact, I fail my students when they make claims their citations cannot support (I’ve gotten stricter about this because it’s such an LLM hallmark). WELL!
You might be surprised to hear that, initially, for at least seven claims a substantial fraction of scholars said there were causally informative field studies yet none were cited in the final consensus statement’s evidence summary according to language model Gemini 2.5 Pro. The first author, Valerio Capraro, seems to agree that the evidence for most claims is weak and correlational. So, I don’t think my assessment that some scholars overestimated the evidence is out of whack with the first author’s views.
So, who are these scholars who think there have been replicated causally informative field studies showing that social media affects body image? Maybe they have references that didn’t make the cut. Unfortunately, I don’t know! Turns out the responses were anonymized, as is standard in the Delphi method. I.e. even for those scholars who participated non-anonymously, we don’t know what they said for each claim. Expertise without accountability? Truly, you can have it all.
What question did they answer exactly when assessing the evidence base for causal claims?
To the best of your knowledge, what is the level of evidence regarding this claim?
- ▢ No evidence, only opinions, perspectives, general theory or anecdotes (1)
- ▢ Some correlational evidence (laboratories, surveys, online, field) (2)
- ▢ Some causal evidence but in limited settings (laboratories, surveys, and online, self-reported measures) (3)
- ▢ Causal evidence in a field study (4)
- ▢ Replicated causal evidence from field studies (5)
- ▢ Wide-scale causal evidence from multiple field studies, policy evaluations or other natural settings (6)
What did they say in Study 1?

Figure caption for Figure 3
Figure 3. Summary of responses to the “level” question for each of the 26 claims from
Survey 1, ordered as in Figure 2. For each claim, the “level” question asked in Survey 1
was: “To the best of your knowledge, what is the level of evidence regarding this claim?”.
Available answers depended on the claim. For non-causal claims 1-7, 16, and 22-23, the
available answers were: No evidence, only opinions, perspectives, general theory or
anecdotes; Some empirical evidence but in limited settings (laboratories, surveys and online,
self-reported measures); Field evidence; Replicated evidence in field studies or other natural
settings; Wide-scale evidence from multiple field studies, policy evaluations or other natural
settings. For causal claims 8-15, 17-21, and 24-26, the available answers were: No evidence,
only opinions, perspectives, general theory or anecdotes; Some correlational evidence
(laboratories, surveys, online, field); Some causal evidence but in limited settings
(laboratories, surveys, and online, self-reported measures); Causal evidence in a field study;
Replicated causal evidence from field studies; Wide-scale causal evidence from multiple field
studies, policy evaluations or other natural settings. For visualization purposes, we
shortened the wording of the claims. Refer to Table 1 for the exact wording of the claims.
As far as I could tell at a glance, the authors didn’t precisely define what counts as causal vs. correlational. I’m one of those weirdos who believe you can infer causality from observational/correlational data if you’re careful, but from the contrast to “correlational”, I guess the authors considered only experiments and natural experiments “causal evidence”.
Now, after examining this graph, you would be excused if you thought these categories were mutually exclusive. But no:
Moreover, after careful consideration within the core group, we decided to allow multiple answers for this question to avoid forcing participants to rank the level of evidence.
Yes, you certainly wouldn’t want to rank levels of evidence. How would those poor correlational studies and opinions feel? Still, you might be confused, after all, Figure 3 (above) tabulates them as if responses summed to 100%. But actually authors could and did endorse multiple mutually exclusive answers. Most obviously, more than 70 responses say both there is “no evidence” and at the same time say there is evidence. I guess they exist in a state of superposition. It would collapse if we actually observed the evidence, but, like, who has the time?
value |
n() |
1,2 |
59 |
1,2,3 |
9 |
1,2,3,4,6 |
1 |
1,3 |
4 |
1,3,4 |
1 |
1,5 |
1 |
1,6 |
1 |
The paper didn’t document how the graph was made, so I asked) and it turns out they expanded the denominator. It sums to 100% because the denominator is “number of checkboxes clicked”. I don’t think that’s a useful denominator. People who checked no box are excluded and you can’t interpret the percentage as the number of experts who endorsed X.
I looked into this because several experts claimed that there were replicated causally informative field studies showing that social media increases visual social comparison in girls. That seemed odd, because a) this seems difficult to study experimentally in the field b) I looked at this literature briefly a while ago and didn’t encounter anything of the sort. So, I checked the evidence summary in Supplementary Section S5. Indeed, no field studies are highlighted at all. Weird. Did they imagine these studies? Do they exist, but for some reason the best-quality evidence (sorry, sorry, I know evidence shan’t be ranked) isn’t mentioned?
To find out how this stacks up for the other claims, I let Gemini 2.5 Pro take a stab at tabulating how often experts say studies exist that then seem to be missing in action in the evidence review. I only hand-checked the visual social comparison section. Gemini agreed with my assessment there. So, here we have it: It’s not just LLMs that occasionally hallucinate scientific papers.
Does this mean the consensus statement is wrong? They went through three more rounds and had a chance to rectify such problems with grounding. Still, I worry that people who admit uncertainty self-selected out and people who incorrectly believe causally informative field studies exist where there are none, self-selected in.
Claim |
% Causal Field (Est. from Fig 3) |
% Causal Field or Higher (Est. from Fig 3) |
S5 Text: Experimental Field, Natural Experiment, or Well-Controlled Design in Field Supporting Causal Claim? |
S5 Text: Observational Study Cited as Supporting Causal Claim? |
8. Heavy daily use of smartphones and social media can cause sleep deprivation. |
~10-12% |
~25-30% |
Yes (Intervention studies like Martin et al., Perrault et al., Tu et al., described as improving sleep by reducing screen time, could be field experiments). Schechter et al. (meta-analysis of blue light interventions) and Hartstein et al. (consensus panel review) also point to experimental/interventional evidence. |
Yes (Longitudinal studies like Rod et al., Liu et al. cited for associations). |
9. Chronic sleep deprivation can cause a decline in mental health. |
~5-7% |
~75-80% |
Yes (Scott et al. meta-analysis of sleep-improvement interventions showing mental health benefits is strong. Pires et al. also likely experimental). |
Yes (Liu et al. review of longitudinal studies). |
10. Heavy daily use of smartphones and social media can cause attention fragmentation. |
~2-3% |
~10-12% |
Potentially (van Wezel et al. intervention, though manipulation failed, was field-based. “Mere presence” studies like Ward et al. are lab). |
Yes (Longitudinal studies like Ra et al., Deng et al. cited for associations with ADHD symptoms). |
11. Attention fragmentation can cause a decline in mental health… |
~2-3% |
~5-7% |
Potentially (Intervention studies on disabling notifications like Fitz et al., Olson et al. are field-based and show some MH impact). |
Yes (Correlational studies on multitasking and MH like Becker et al., Shin et al.). |
12. Heavy daily use of smartphones and social media can cause behavioral addiction. |
~7-10% |
~20-25% |
Yes (Allcott et al. 2022 is explicitly cited as a field study suggesting habit formation and self-control problems). |
Yes (Truzoli et al. on distress post-internet; Anderson & Wood on blunted reward sensitivity). |
13. Behavioral addiction can cause a decline in mental health. |
~5-7% |
~50-55% |
No direct experimental field study cited for causing decline. The argument is often definitional or based on comorbidity. |
Yes (Hartmann & Blaszczynski longitudinal study on gambling; Petry et al., Ko et al. on comorbidity). |
14. Heavy daily use of smartphones and social media can cause social deprivation… |
~2-4% |
~10-15% |
Yes (Allcott et al. 2020 deactivating Facebook was an experimental field study. Geraci et al. 2022 quasi-experimental study on internet speed is a natural experiment). |
Yes (Meta-analyses by Appel et al., Liu & Baumeister cited, though Cheng et al. found no overall association). |
15. Chronic social deprivation can cause a decline in mental health. |
~10-12% |
~60-65% |
Yes (Killgore et al. 2020 on lockdown effects is a natural experiment. Intervention studies like Alvarez et al., Costello et al., Lai et al. on increasing social interaction improve MH and are likely field-based). |
Yes (Cacioppo et al. longitudinal study on isolation and depression). |
17. Social media increases visual social comparisons among adolescent girls. |
~2-3% |
~15-20% |
No (Cited experimental studies like Fardouly et al. 2015, Kleemans et al. 2018, Tiggemann & Anderberg 2020 are lab/online, not field studies measuring social comparison as an outcome in a consequential real-world setting). |
Yes (Couture Bue 2020 eye-tracking study; Fardouly et al. 2017 survey). |
18. Social media increases perfectionism among adolescent girls. |
~2-3% |
~5-8% |
No (Consensus states no experimental studies directly manipulating social media for this. Vanhoffelen et al. 2025 longitudinal study showed weak evidence). |
Yes (Surveys by Danielsen et al. 2024, Simon et al. 2022, Harren et al. 2021). |
19. Social media increases relational aggression among adolescent girls… |
~2-3% |
~5-7% |
No (Consensus states no experimental evidence for SM use -> increased RA. Martins & Weaver meta-analysis included experiments on “exposure to RA -> RA behavior,” not direct SM manipulation). |
Yes (Marinoni et al. 2023 on cyberbullying vulnerability). |
20. Among adolescent girls, social media increases exposure to other people displaying or discussing their mental disorders. |
~3-5% |
~15-20% |
No (Consensus states no direct causal link references. Cited studies are observational, e.g., phenomena on TikTok, Twitter disclosures). |
Yes (Alho et al. on peer networks; Hamilton et al. on self-harm content exposure). |
21. Social media increases sexual predation and harassment of adolescent girls… |
~1-2% |
~10-15% |
Potentially (Bhuller et al. 2013 quasi-experimental study on general internet access and sex crimes is a natural experiment, but not specific to SM mechanisms for predation). Consensus states no SM-specific experimental studies. |
Yes (Bratton et al. 2020 on Facebook use and online harassment). |
24. If most parents waited… it would benefit the mental health of adolescents overall. |
~2-3% |
~5-8% |
No (Cited studies are correlational/longitudinal. No experimental field studies manipulating delayed access). |
Yes (Dempsey et al., Vaterlaus et al., Gerosa et al., Sapien Labs, Sun 2023, all observational). |
25. Imposing (and enforcing) a legal minimum age of 16… would benefit the mental health of adolescents overall. |
~1-2% |
~5-7% |
No (Consensus states no strictly related references. Cited experimental work like Ferguson 2024 is on temporary abstention, not age limits). |
Yes (General research linking SM to decreased MH like Twenge & Campbell 2018 cited as rationale, but it’s not testing the policy). |
26. Phone-free schools would benefit the mental health of adolescents overall. |
~5-7% |
~10-15% |
Yes (Studies of school phone bans cited in Böttger & Zierer 2024 meta-analysis, King et al. 2024, Campbell et al. 2024 review, Goodyear et al. 2025 meta-analysis are natural experiments in field settings). |
No (The studies cited are evaluations of bans, not general observational studies of phone use in schools). |
Notes on Estimation from Figure 3:
The percentages for “% Causal Field” and “% Causal Field or Higher” are visual estimations from the provided stacked bar chart (Figure 3, p. 11 of the PDF). They are approximate due to the nature of reading from a graph.
My requests
I made a few requests and then had Gemini summarise. I put the final summary up top, the rest is in conversational order.
For each of the claims, there is a summary of the evidence in Supplementary Section 5. Look only at claims that make a causal claim. Then, note, for each claim, how many experts assessed the evidence as being causal, causal/field, replicated evidence, widespread causal evidence. Then, assess whether the studies cited in that section are causally informative, field studies or direct replications of one another. Note if there is a conflict with the expert assessment.
Start with the claim that “Social media increases visual social comparison in girls”. We’ll take the claims in turn.
You’re right, Figure S3 (which is Figure 3 in the main text on page 11, not Supplementary Section S4 as I previously misremembered) is a stacked bar chart where the segments do appear to represent mutually exclusive portions of the respondents who answered the “level” question for each claim, likely categorized by the highest level of evidence they endorsed.
My apologies, my previous interpretation was incorrect based on how these charts are typically constructed to represent a whole. The explicit mention of allowing multiple answers is for the data collection phase, but for visualization in a stacked bar chart aiming for 100%, a recoding to the highest level endorsed is standard.
Let’s re-examine based on this understanding and your new request to specifically look for experimental field studies, natural experiments in the field, or well-controlled designs cited as supporting the causal claim in Section S5, and to report the “Causal Field” percentage directly from Figure 3.
Revised Interpretation of Figure 3 (formerly S3):
The bars in Figure 3 sum to 100% for each claim (representing all experts who provided a “level of evidence” response for that claim). Each colored segment represents the proportion of those experts for whom that specific level was their highest endorsed level of evidence.
- The dark purple (“Wide-scale causal”) is the highest.
- The next blue (“Wide-scale”) would be for non-causal claims.
- Then “Replicated causal field,” “Replicated field,” “Causal field,” “Field,” “Causal limited,” “Correlational,” “Empirical,” and finally “None (e.g., opinion).”
- The “No response” category mentioned in the Figure S3 caption (page 63) is not visually part of the stacked bars in Figure 3 on page 11, meaning Figure 3 likely only includes those who did provide a level of evidence.
I will now go through the claims again, try to estimate the “Causal field” percentage from the visual in Figure 3 (page 11), and assess the S5 text for the types of evidence you’ve specified.
Revised Table
Claim |
% Causal Field (Est. from Fig 3) |
% Causal Field or Higher (Est. from Fig 3) |
S5 Text: Experimental Field, Natural Experiment, or Well-Controlled Design in Field Supporting Causal Claim? |
S5 Text: Observational Study Cited as Supporting Causal Claim? |
8. Heavy daily use of smartphones and social media can cause sleep deprivation. |
~10-12% |
~25-30% |
Yes (Intervention studies like Martin et al., Perrault et al., Tu et al., described as improving sleep by reducing screen time, could be field experiments). Schechter et al. (meta-analysis of blue light interventions) and Hartstein et al. (consensus panel review) also point to experimental/interventional evidence. |
Yes (Longitudinal studies like Rod et al., Liu et al. cited for associations). |
9. Chronic sleep deprivation can cause a decline in mental health. |
~5-7% |
~75-80% |
Yes (Scott et al. meta-analysis of sleep-improvement interventions showing mental health benefits is strong. Pires et al. also likely experimental). |
Yes (Liu et al. review of longitudinal studies). |
10. Heavy daily use of smartphones and social media can cause attention fragmentation. |
~2-3% |
~10-12% |
Potentially (van Wezel et al. intervention, though manipulation failed, was field-based. “Mere presence” studies like Ward et al. are lab). |
Yes (Longitudinal studies like Ra et al., Deng et al. cited for associations with ADHD symptoms). |
11. Attention fragmentation can cause a decline in mental health… |
~2-3% |
~5-7% |
Potentially (Intervention studies on disabling notifications like Fitz et al., Olson et al. are field-based and show some MH impact). |
Yes (Correlational studies on multitasking and MH like Becker et al., Shin et al.). |
12. Heavy daily use of smartphones and social media can cause behavioral addiction. |
~7-10% |
~20-25% |
Yes (Allcott et al. 2022 is explicitly cited as a field study suggesting habit formation and self-control problems). |
Yes (Truzoli et al. on distress post-internet; Anderson & Wood on blunted reward sensitivity). |
13. Behavioral addiction can cause a decline in mental health. |
~5-7% |
~50-55% |
No direct experimental field study cited for causing decline. The argument is often definitional or based on comorbidity. |
Yes (Hartmann & Blaszczynski longitudinal study on gambling; Petry et al., Ko et al. on comorbidity). |
14. Heavy daily use of smartphones and social media can cause social deprivation… |
~2-4% |
~10-15% |
Yes (Allcott et al. 2020 deactivating Facebook was an experimental field study. Geraci et al. 2022 quasi-experimental study on internet speed is a natural experiment). |
Yes (Meta-analyses by Appel et al., Liu & Baumeister cited, though Cheng et al. found no overall association). |
15. Chronic social deprivation can cause a decline in mental health. |
~10-12% |
~60-65% |
Yes (Killgore et al. 2020 on lockdown effects is a natural experiment. Intervention studies like Alvarez et al., Costello et al., Lai et al. on increasing social interaction improve MH and are likely field-based). |
Yes (Cacioppo et al. longitudinal study on isolation and depression). |
17. Social media increases visual social comparisons among adolescent girls. |
~2-3% |
~15-20% |
No (Cited experimental studies like Fardouly et al. 2015, Kleemans et al. 2018, Tiggemann & Anderberg 2020 are lab/online, not field studies measuring social comparison as an outcome in a consequential real-world setting). |
Yes (Couture Bue 2020 eye-tracking study; Fardouly et al. 2017 survey). |
18. Social media increases perfectionism among adolescent girls. |
~2-3% |
~5-8% |
No (Consensus states no experimental studies directly manipulating social media for this. Vanhoffelen et al. 2025 longitudinal study showed weak evidence). |
Yes (Surveys by Danielsen et al. 2024, Simon et al. 2022, Harren et al. 2021). |
19. Social media increases relational aggression among adolescent girls… |
~2-3% |
~5-7% |
No (Consensus states no experimental evidence for SM use -> increased RA. Martins & Weaver meta-analysis included experiments on “exposure to RA -> RA behavior,” not direct SM manipulation). |
Yes (Marinoni et al. 2023 on cyberbullying vulnerability). |
20. Among adolescent girls, social media increases exposure to other people displaying or discussing their mental disorders. |
~3-5% |
~15-20% |
No (Consensus states no direct causal link references. Cited studies are observational, e.g., phenomena on TikTok, Twitter disclosures). |
Yes (Alho et al. on peer networks; Hamilton et al. on self-harm content exposure). |
21. Social media increases sexual predation and harassment of adolescent girls… |
~1-2% |
~10-15% |
Potentially (Bhuller et al. 2013 quasi-experimental study on general internet access and sex crimes is a natural experiment, but not specific to SM mechanisms for predation). Consensus states no SM-specific experimental studies. |
Yes (Bratton et al. 2020 on Facebook use and online harassment). |
24. If most parents waited… it would benefit the mental health of adolescents overall. |
~2-3% |
~5-8% |
No (Cited studies are correlational/longitudinal. No experimental field studies manipulating delayed access). |
Yes (Dempsey et al., Vaterlaus et al., Gerosa et al., Sapien Labs, Sun 2023, all observational). |
25. Imposing (and enforcing) a legal minimum age of 16… would benefit the mental health of adolescents overall. |
~1-2% |
~5-7% |
No (Consensus states no strictly related references. Cited experimental work like Ferguson 2024 is on temporary abstention, not age limits). |
Yes (General research linking SM to decreased MH like Twenge & Campbell 2018 cited as rationale, but it’s not testing the policy). |
26. Phone-free schools would benefit the mental health of adolescents overall. |
~5-7% |
~10-15% |
Yes (Studies of school phone bans cited in Böttger & Zierer 2024 meta-analysis, King et al. 2024, Campbell et al. 2024 review, Goodyear et al. 2025 meta-analysis are natural experiments in field settings). |
No (The studies cited are evaluations of bans, not general observational studies of phone use in schools). |
Notes on Estimation from Figure 3:
The percentages for “% Causal Field” and “% Causal Field or Higher” are visual estimations from the provided stacked bar chart (Figure 3, p. 11 of the PDF). They are approximate due to the nature of reading from a graph.
Key Revisions and Observations:
- The table now includes an estimated “% Causal Field” directly from the figure.
- The assessment specifically looks for experimental field studies, natural experiments in the field, or well-controlled designs in the field that Section S5 cites as supporting the causal claim.
- An additional column notes if Section S5 cites observational studies as evidence for the causal direction (often longitudinal or strong associational data).
- For many claims, especially those about “Social media increases X,” direct experimental field evidence cited in S5 is often lacking, even if observational data or lab experiments are mentioned. The expert ratings for “Causal Field” for these claims often seem to reflect a broader interpretation or knowledge base beyond what’s explicitly detailed as experimental field research in the S5 summary.
- The policy-related claims (#24, #25) still show a lack of direct experimental field testing of the policies themselves, though #26 (phone-free schools) has natural experiment data.
Response
Okay, this is a detailed task. I will go through each of the identified causal claims, extract the expert assessment of evidence levels, and then analyze the studies cited in the consensus statement for that claim to assess their nature (causally informative, field, replication) and identify any conflicts.
The definition of “Field evidence” provided in the document (page 24) is: “Evidence collected in real-world, consequential settings using objective measures (e.g., official statistics in real-world settings, such as suicide rates; self-reported surveys are not field evidence).” This is a strict definition. “Causally informative” will generally refer to experimental studies (lab, field, or well-designed interventions).
Let’s start with Claim 17 as requested, then proceed through the other causal claims.
Causal Claims from Table 1:
* General Causal Claims: 8, 9, 10, 11, 12, 13, 14, 15
* Gender-related Causal Claims: 17, 18, 19, 20, 21
* Policy Recommendation Causal Claims: 24, 25, 26
Claim 17. Social media increases visual social comparisons among adolescent girls. (Consensus statement: p. 98-100)
- Expert Assessment of Evidence Level (N=96):
- No evidence: 14.58%
- Correlational: 57.29%
- Causal limited (lab, surveys, online, self-reported): 44.79%
- Causal Field: 11.46%
- Replicated Causal Field: 12.50%
- Wide-scale Causal: 8.33%
- Analysis of Cited Studies in Consensus Statement for Claim 17:
- Saiphoo & Vahedi (2019): Meta-analysis of 63 samples. Found social media use associated with decreased body satisfaction. Gender not a significant moderator. (This is correlational in nature for the primary studies it likely aggregates, though meta-analyses can include experimental work).
- Fioravanti et al. (2022): Systematic review of experimental studies. Indicated idealized images increased body dissatisfaction, with appearance comparison as a mediator. (Causally informative - experimental).
- Couture Bue (2020): Eye-tracking study. Showed Instagram use predicted greater attention to body regions associated with anxiety, effect mediated by appearance comparisons and body dissatisfaction. (Observational/correlational, though eye-tracking provides objective attention measures).
- Goodyear et al. (2022): Qualitative studies. Highlights how girls use social media to compare bodies. (Descriptive, not primarily causal).
- Fardouly et al. (2015): Experimental study. Assigned young women to browse Facebook, magazines, or websites. Found no significant Facebook effect on body dissatisfaction overall, but identified a moderating role of upward social comparison (those high in it showed increased discrepancy after Facebook). (Causally informative - experimental).
- Kleemans et al. (2018): Experimental study. Exposed adolescent girls to edited Instagram content. Girls with higher comparison tendency were more strongly affected. (Causally informative - experimental).
- Tiggemann & Anderberg (2020): Experimental study. Viewing “Instagram vs. reality” content reduced body dissatisfaction compared to idealized images. Appearance comparison did not significantly vary across conditions. (Causally informative - experimental).
- Ferguson (2018): Cited as a hypothesis that social media reminds women of body dissatisfaction. (Likely a review/theoretical piece, not primary causal data).
- Roberts et al. (2022): Stated as finding “social media contributes to body dissatisfaction, beyond traditional media.” The nature of “contributes” is key; if based on predictive modeling of non-experimental data, it’s correlational.
- Fardouly et al. (2017): Found upward comparisons on social media are associated with worse body image outcomes compared to in-person or traditional media. (Correlational).
- Assessment of Cited Studies:
- Causally Informative: Fioravanti et al. (2022 review of experiments), Fardouly et al. (2015 experiment), Kleemans et al. (2018 experiment), Tiggemann & Anderberg (2020 experiment). These are primarily lab/online experiments.
- Field Studies (as per document definition): None of the cited experimental studies appear to be field studies under the strict definition (real-world, consequential settings, objective measures like official statistics).
- Direct Replications: The experimental studies address similar constructs but use different specific manipulations and measures; they are not direct replications of each other.
- Conflict Assessment:
- The expert assessment of “Causal limited” (44.79%) is well-supported by the citation of several experimental studies.
- The expert ratings for “Causal Field” (11.46%), “Replicated Causal Field” (12.50%), and “Wide-scale Causal” (8.33%) seem less directly supported by the explicitly described experimental studies in the consensus text, as these are mostly lab/online experiments, not field studies.
- The consensus statement concludes that evidence suggests an association between social media use and body dissatisfaction, with the role of increased visual social comparison as the direct causal link still under consideration and likely moderated. This aligns with the mixed findings from the experimental studies regarding the specific mechanism of visual social comparison.
Now, for the other causal claims:
Claim 8. Heavy daily use of smartphones and social media can cause sleep deprivation. (Consensus statement: p. 77-79)
- Expert Assessment of Evidence Level (N=105):
- No evidence: 4.76%
- Correlational: 64.76%
- Causal limited: 57.14%
- Causal Field: 20.00%
- Replicated Causal Field: 8.57%
- Wide-scale Causal: 9.52%
- Analysis of Cited Studies:
- Correlational: Alonzo et al. (2021), Cheng et al. (2024), Alimoradi et al. (2019), Orben & Przybylski (2020), Gjoneska et al. (2022), Brosnan et al. (2024), Ahmed et al. (2024 meta-analysis of largely correlational studies), Sumter et al. (2024).
- Longitudinal (observational): Rod et al. (2018), Liu et al. (2019).
- Intervention studies (reducing screen time): Martin et al. (2020), Perrault et al. (2019), Tu et al. (2023). (Causally informative). These could be considered field experiments depending on setting and measures.
- Laboratory experiments (biological mechanisms): Chang et al. (2015) - light emission. (Causally informative).
- Meta-analysis of interventions: Schechter et al. (2022) - blue light reduction. (Causally informative).
- Consensus panel review: Hartstein et al. (2024) - reviewed experimental and intervention studies. (Summarizes causally informative work).
- Assessment of Cited Studies:
- Causally Informative: Martin et al. (2020), Perrault et al. (2019), Tu et al. (2023), Chang et al. (2015), Schechter et al. (2022), Hartstein et al. (2024 review).
- Field Studies: The intervention studies (Martin, Perrault, Tu) could qualify if conducted in naturalistic settings with objective sleep measures (though some might use self-report). The consensus text doesn’t detail their methodology to confirm field status per the document’s definition.
- Direct Replications: The intervention studies are conceptually similar but likely not direct replications.
- Conflict Assessment:
- The high expert rating for “Correlational” (64.76%) and “Causal limited” (57.14%) is well-supported by the cited literature. The “Causal Field” (20.00%) rating is plausible given the intervention studies.
- No major conflict. The consensus accurately reflects a mix of correlational and some experimental/interventional evidence. The conclusion that the extent to which it specifically causes sleep deprivation (vs. general sleep problems) is unclear seems justified.
Claim 9. Chronic sleep deprivation can cause a decline in mental health. (Consensus statement: p. 81)
- Expert Assessment of Evidence Level (N=100):
- No evidence: 1.00%
- Correlational: 44.44% (Note: this should be “Correlational” as per the question options for causal claims, the text lists it as 44.00% without the label but it’s the first option after “No evidence”).
- Causal limited: 45.00%
- Causal field: 26.00%
- Replicated causal field: 30.00%
- Wide-scale causal: 44.00%
- Analysis of Cited Studies:
- Pires et al. (2016): Sleep deprivation shown to increase state anxiety. (Likely experimental if “shown to increase”).
- Roberts & Duong (2014): Six or fewer hours of sleep predicts higher depression. (Longitudinal observational, predictive but not strictly experimental manipulation of sleep deprivation).
- Liu et al. (2020): Review of 41 longitudinal studies; sleep disturbances predictors of suicidal ideation. (Summarizes observational longitudinal work).
- Scott et al. (2021): Meta-analysis of sleep-improvement interventions. Improving sleep had positive effect on mental health. (Causally informative).
- Short et al. (2022): Highlights vulnerability of adolescents. (Likely review/conceptual).
- McEwen & Lasley (2002): Biological mechanisms – sleep deprivation disrupts brain areas. (Reviews of mechanistic studies, some of which could be experimental).
- Assessment of Cited Studies:
- Causally Informative: Scott et al. (2021 meta-analysis of interventions), Pires et al. (2016, likely experimental). Mechanistic studies discussed by McEwen & Lasley (2002) could also be experimental.
- Field Studies: The interventions reviewed by Scott et al. might include field studies.
- Direct Replications: Unclear from the summary.
- Conflict Assessment:
- The expert ratings are quite high across “Causal limited” (45%), “Replicated causal field” (30%), and “Wide-scale causal” (44%).
- The cited evidence strongly supports a causal link, especially through the meta-analysis of interventions (Scott et al.). The biological mechanism work also supports causality.
- The consensus is strong. There isn’t a clear conflict; the evidence cited (especially interventions) supports the higher-level causal inferences made by experts.
Claim 10. Heavy daily use of smartphones and social media can cause attention fragmentation. (Consensus statement: p. 82-83)
- Expert Assessment of Evidence Level (N=88):
- No evidence: 9.09%
- Correlational: 63.64%
- Causal limited: 57.95%
- Causal field: 9.09%
- Replicated causal field: 5.68%
- Wide-scale causal: 7.95%
- Analysis of Cited Studies:
- Santos et al. (2022): Systematic review; excessive screen time in children can be associated with attention problems. (Reviews largely correlational work).
- Barry et al. (2017): Observed an association between number of social media accounts and DSM-5 inattention symptoms. (Correlational).
- Longitudinal studies on smartphone use and ADHD symptoms: George et al. (2017) - association; Ra et al. (2018) - high social media use associated with later ADHD symptoms; Deng et al. (2024) - online social activities linked to ADHD symptoms. (Observational longitudinal).
- Studies on “mere presence” of smartphone: Ward et al. (2017), Skowronek et al. (2023) - presence could reduce cognitive capacity. (Experimental, causally informative for “mere presence” not “heavy use”).
- Parry (2024): Meta-analysis; mere presence negatively impacted working memory. (Reviews experimental work on “mere presence”).
- van Wezel et al. (2021): Experimental study (reducing social media); no effect on attention (manipulation failed). (Causally informative, but with null/inconclusive result due to design issue).
- Assessment of Cited Studies:
- Causally Informative: Ward et al. (2017), Skowronek et al. (2023), Parry (2024 meta-analysis of these types of studies), van Wezel et al. (2021). These mostly relate to “mere presence” or effects of reduction, not directly “heavy use causes fragmentation.”
- Field Studies: van Wezel et al. was an intervention that could be a field study.
- Direct Replications: Not apparent.
- Conflict Assessment:
- Experts rate “Correlational” (63.64%) and “Causal limited” (57.95%) high.
- The cited literature is predominantly correlational or observational longitudinal, especially regarding “heavy use.” The experimental evidence cited (Ward, Skowronek, Parry) focuses on the “mere presence” effect, which is different from “heavy daily use.”
- The consensus statement itself notes the definitional issues with “attention fragmentation” vs. “inattention/ADHD.” The conclusion that “the evidence is not robust enough to definitively confirm a causal link” for heavy use causing attention fragmentation seems justified given the cited studies. The expert rating for “Causal limited” might be slightly generous if strictly applied to “heavy use causing fragmentation” as opposed to related concepts.
Claim 11. Attention fragmentation can cause a decline in mental health (possibly through mediating factors such as its negative impact on social relationships). (Consensus statement: p. 84-85)
- Expert Assessment of Evidence Level (N=54):
- No evidence: 29.62%
- Correlational: 48.15%
- Causal limited: 25.93%
- Causal field: 7.41%
- Replicated causal field: 7.41%
- Wide-scale causal: 5.55%
- Analysis of Cited Studies: (The statement notes “attention fragmentation” is not well-established, so it discusses related constructs).
- Mind-wandering: Killingsworth & Gilbert (2010) - mind-wandering associated with unhappiness (correlational). Smallwood & Schooler (2015) - review. Poerio et al. (2013), Welz et al. (2018) - null or positive effects of mind-wandering (correlational).
- Cognitive load: Mizuno et al. (2011), Hawthorne et al. (2019) - link between cognitive load and negative well-being (correlational). Hu et al. (2021) - cognitive load may increase well-being in some cases (potentially experimental or quasi-experimental).
- Multitasking: Becker et al. (2013), Shin et al. (2024) - media multitasking associated with higher depression/anxiety (correlational). Hatchel et al. (2018) - associated with negative affect/low self-esteem (correlational).
- Intervention studies (disabling notifications): Fitz et al. (2019), Olson et al. (2023) - positive effects on well-being (causally informative). Dekker et al. (2024), Liao et al. (2022) - drawbacks like increased checking/anxiety (causally informative, but for a different outcome).
- Assessment of Cited Studies:
- Causally Informative: Fitz et al. (2019), Olson et al. (2023), Dekker et al. (2024), Liao et al. (2022) – these are interventions on notifications, indirectly related to fragmentation. Hu et al. (2021) might be.
- Field Studies: The intervention studies might qualify.
- Direct Replications: No.
- Conflict Assessment:
- Expert ratings are highest for “Correlational” (48.15%) and relatively low for causal levels. “No evidence” is also high (29.62%).
- The cited literature is heavily correlational for the link between related constructs (mind-wandering, cognitive load, multitasking) and mental health. The experimental evidence on interventions (disabling notifications) is mixed and indirect.
- The consensus statement’s conclusion that “evidence regarding the effect of attention fragmentation on mental health is preliminary” aligns well with the cited studies and the expert assessment. No major conflict.
Claim 12. Heavy daily use of smartphones and social media can cause behavioral addiction. (Consensus statement: p. 87-88)
- Expert Assessment of Evidence Level (N=94):
- No evidence: 9.57%
- Correlational: 64.89%
- Causal limited: 48.93%
- Causal field: 19.15%
- Replicated causal field: 10.64%
- Wide-scale causal: 11.70%
- Analysis of Cited Studies:
- Truzoli et al. (2023): Social media users with high Internet Addiction Test scores reported lower distress after ceasing internet session (consistent with mobile device use as sedative). (Correlational/observational).
- Anderson & Wood (2023): Frequent social media users show blunted sensitivity to social rewards (habituation). (Observational, potentially experimental if reward sensitivity was tested).
- Allcott et al. (2022): Field study suggesting social media usage may involve self-control problems and be habit-forming. (Causally informative – likely an intervention or natural experiment).
- TikTok internal study (NPR, 2024): 35 mins of videos sufficient to develop a habit. (Internal industry data, nature of study unclear but suggests rapid habit formation).
- Neuroimaging studies (He et al., 2017; Montag et al., 2017): Parallels in brain activity between problematic smartphone use and other behavioral addictions. (Correlational/Observational).
- Books on dopamine/addiction (Courtwright, 2019; Lembke, 2021). Design features (Schüll, 2012; Sherman et al., 2016; Lindström et al., 2021). (Theoretical/Mechanistic arguments).
- Assessment of Cited Studies:
- Causally Informative: Allcott et al. (2022) is explicitly called a field study and its findings (self-control, habit) are causally suggestive.
- Field Studies: Allcott et al. (2022).
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (64.89%) and “Causal limited” (48.93%) high. “Causal field” is 19.15%.
- The consensus statement itself notes “no experts provided experimental evidence about this claim” and “experimental evidence is virtually absent.” This seems to contradict the mention of Allcott et al. (2022) as a field study suggesting habit formation. However, “habit formation” and “self-control problems” are not identical to “causing behavioral addiction” as a clinical disorder.
- There’s a slight tension: experts rate some level of causal evidence, one field study with suggestive findings is cited, but the consensus text emphasizes the absence of direct experimental proof for addiction as a disorder. The conclusion that “preliminary correlational evidence” exists but “experimental evidence is virtually absent” is largely accurate for addiction as a formally recognized disorder, even if related phenomena like habit formation have some experimental backing.
Claim 13. Behavioral addiction can cause a decline in mental health. (Consensus statement: p. 90)
- Expert Assessment of Evidence Level (N=81):
- No evidence: 12.35%
- Correlational: 50.62%
- Causal limited: 41.96%
- Causal field: 19.75%
- Replicated causal field: 23.46%
- Wide-scale causal: 34.57%
- Analysis of Cited Studies:
- Grant et al. (2010): Review; behavioral addictions share similarities with substance addictions (co-occurring disorders).
- Petry et al. (2005): High comorbidity between pathological gambling and psychiatric disorders (depression, anxiety). (Cross-sectional/Correlational).
- Ko et al. (2009): Linked excessive online gaming to depression, anxiety, social phobia. (Likely correlational).
- Hartmann & Blaszczynski (2018): Gambling problems can lead to increased depression/anxiety over time. (Longitudinal, suggesting directionality).
- Marchant et al. (2017): Review; linking “internet-related problematic behaviors” to self-harm/suicidal behaviors, but direction of causality unclear.
- Alimoradi et al. (2024): Meta-analysis; problematic internet-related behaviors consistently associated with mental health issues. (Aggregates largely correlational data).
- Assessment of Cited Studies:
- Causally Informative: Hartmann & Blaszczynski (2018) is longitudinal and suggests directionality, which is a step towards causal inference but not a manipulated experiment. Most others are correlational or reviews of correlational work. The claim is broad (“behavioral addiction”), and some of these studies (gambling) have a longer history of research than, say, social media addiction.
- Field Studies: Not apparent from summaries.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (50.62%) and “Causal limited” (41.96%) highest, with “Wide-scale causal” also notably high (34.57%).
- The literature cited is largely based on associations, comorbidities, and some longitudinal data (gambling). The consensus states the claim is “true by definition” for DSM/ICD, which is a strong point.
- The conclusion that “the direction of the causal relationship remains unclear” (despite the definitional point) reflects the limitations of most primary research (often correlational) vs. the clinical definitions. No major conflict, but the high “Wide-scale causal” rating might be more based on the definitional aspect and long-standing clinical observation for established addictions (like gambling) rather than rigorous experimental proof for all behavioral addictions causing mental health decline.
Claim 14. Heavy daily use of smartphones and social media can cause social deprivation, such as isolation and lack of formative social experiences. (Consensus statement: p. 92-93)
- Expert Assessment of Evidence Level (N=83):
- No evidence: 13.25%
- Correlational: 63.86%
- Causal limited: 42.17%
- Causal field: 14.46%
- Replicated causal field: 9.64%
- Wide-scale causal: 8.43%
- Analysis of Cited Studies:
- Meta-analyses on social media & loneliness: Appel et al. (2020), Liu & Baumeister (2016) - found positive correlation. Cheng et al. (2019) - largest meta-analysis, no overall association.
- Geraci et al. (2022): Quasi-experimental; higher connection speeds/internet use can lead to reduced social engagement. (Causally suggestive).
- Allcott et al. (2020): Experimental (deactivating Facebook); results in more in-person socialization but also more solitary TV. (Causally informative).
- Pew Research (Anderson & Jiang, 2018): Survey; 81% teens feel more connected via social media. (Correlational, self-report).
- Steinsbekk et al. (2024): Social media use associated with more time with friends offline. (Correlational/Longitudinal).
- Studies on benefits for LGBTQ+ youth (Berger et al., 2022; Paceley et al., 2022) or students with disabilities.
- Wang et al. (2018): Longitudinal; U-shaped relationship between active Facebook use and loneliness.
- Displacement hypothesis studies (Verduyn et al., 2021; Dienlin et al., 2017; Hall et al., 2019a, 2019b) - mixed evidence, social media may displace other entertainment not just social interaction.
- Assessment of Cited Studies:
- Causally Informative: Geraci et al. (2022, quasi-experimental), Allcott et al. (2020, experimental).
- Field Studies: Allcott et al. (2020) was a field experiment.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (63.86%) and “Causal limited” (42.17%) high.
- The cited literature is mixed: correlational studies go both ways, meta-analyses are mixed, and a couple of experimental/quasi-experimental studies show some causal effects (both positive and negative for social connection aspects).
- The consensus conclusion that the “strength and even the direction of the potential causal relationship…likely depends on various…factors” and that future work should explore moderators aligns well with the mixed evidence and expert ratings. No major conflict.
Claim 15. Chronic social deprivation can cause a decline in mental health. (Consensus statement: p. 95)
- Expert Assessment of Evidence Level (N=91):
- No evidence: 1.10%
- Correlational: 49.45%
- Causal limited: 51.65%
- Causal field: 32.97%
- Replicated causal field: 27.47%
- Wide-scale causal: 37.36%
- Analysis of Cited Studies:
- National health surveys (NHS, Newlove-Delgado et al., 2022): report associations between social deprivation and poor mental health. (Correlational).
- Cacioppo et al. (2006): Longitudinal; demonstrated increased depression in socially isolated participants. (Observational longitudinal).
- Dickerson & Kemeny (2004): Social-evaluative stressors associated with cortisol levels. (Likely review of experimental and correlational work).
- Field studies/Natural experiments (COVID-19): Killgore et al. (2020) - analyzed mental health data across lockdown, found increased depression/suicidal ideation associated with extended social isolation. (Natural experiment, observational).
- Experimental studies: Alvarez et al. (2024), Costello et al. (2022), Lai et al. (2020) - increasing social interactions improves mental health outcomes. (Causally informative - interventions).
- Holt-Lunstad (2024): Review; social connections a key factor.
- Assessment of Cited Studies:
- Causally Informative: Alvarez et al. (2024), Costello et al. (2022), Lai et al. (2020) are intervention studies. The natural experiment by Killgore et al. (2020) is also causally suggestive.
- Field Studies: The interventions might qualify. Killgore et al. is a natural field study.
- Direct Replications: Not apparent.
- Conflict Assessment:
- Experts rate “Causal limited” (51.65%) high, with substantial ratings for “Causal field” (32.97%) and “Wide-scale causal” (37.36%).
- The cited literature includes strong correlational, longitudinal, natural experimental, and intervention studies. The consensus itself notes the converging evidence but points out the ethical impossibility of depriving people of social interaction in controlled experiments for this specific claim.
- The statement’s conclusion that “experts agreed that the evidence supports the claim” is consistent. The evidence is strong, even if direct experimental deprivation isn’t ethically feasible. No major conflict.
Claim 18. Social media increases perfectionism among adolescent girls. (Consensus statement: p. 101-102)
* Expert Assessment of Evidence Level (N=72):
* No evidence: 23.61%
* Correlational: 63.89%
* Causal limited: 27.78%
* Causal field: 9.72%
* Replicated causal field: 5.55%
* Wide-scale causal: 5.55%
- Analysis of Cited Studies:
- Danielsen et al. (2024): Survey; upward social comparison associated with higher perfectionism (boys & girls). (Correlational).
- Simon et al. (2022): Survey; self-reported Instagram addiction positively correlated with concerns about physical imperfections. (Correlational).
- Harren et al. (2021): Survey; problematic Instagram/Facebook uses linked to perfectionism. (Correlational).
- Vanhoffelen et al. (2025): Longitudinal study; weak evidence that exposure to superior individuals on social media increases perfectionism. (Observational longitudinal).
- Broader research on rising perfectionism (Curran & Hill, 2019) - not specific to social media.
- Assessment of Cited Studies:
- Causally Informative: The consensus states “no evidence from experimental studies directly manipulating social media exposure.” Vanhoffelen et al. (2025) is longitudinal, which offers more than cross-sectional but isn’t experimental.
- Field Studies: None cited.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (63.89%) highest, with “Causal limited” at 27.78%. “No evidence” is also substantial (23.61%).
- The cited primary research specific to social media and perfectionism is correlational or longitudinal with weak findings. The consensus explicitly states the lack of experimental evidence.
- The conclusion that “social media may be associated with perfectionism” and that future studies should “explore potential causal links” aligns perfectly with the cited evidence and expert assessments. No conflict.
Claim 19. Social media increases relational aggression among adolescent girls, for example by providing tools for cyberbullying and exclusion. (Consensus statement: p. 103-104)
- Expert Assessment of Evidence Level (N=71):
- No evidence: 19.72%
- Correlational: 69.01%
- Causal limited: 30.99%
- Causal field: 8.45%
- Replicated causal field: 1.41%
- Wide-scale causal: 4.23%
- Analysis of Cited Studies:
- Marinoni et al. (2023): Found girls more vulnerable to cyberbullying/victimization than boys during pandemic. (Correlational, specific context).
- Martins & Weaver (2019): Meta-analysis (surveys, longitudinal, experimental); exposure to relational aggression strongly predicted relationally aggressive behaviors. (This meta-analysis includes experimental studies, making it causally informative for the “exposure to RA -> RA behavior” link, but not directly “social media use -> increases RA”).
- Kvardova et al. (2025): Higher pro-social social media use positively related to increased prosocial behaviors. (Correlational, opposite direction for prosocial).
- Hinduja & Patchin (2014): Argues social media features may amplify relational aggression. (Theoretical/Argumentative).
- Lapidot-Lefler et al. (2012): Lab study (adults); lack of eye contact can contribute to disinhibition/toxic behavior. (Experimental, but on adults and general toxic behavior, not specific to social media or relational aggression in girls).
- Assessment of Cited Studies:
- Causally Informative: Martins & Weaver (2019) meta-analysis includes experimental studies, but the link is “exposure to RA -> RA behaviors”. Lapidot-Lefler et al. (2012) is experimental but indirect. The consensus notes “no experts provided experimental evidence establishing a causal link between social media use and increased relational aggression.”
- Field Studies: None cited directly.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (69.01%) highest, with “Causal limited” (30.99%) next.
- The primary evidence cited is largely correlational or indirect. The experimental evidence from Martins & Weaver is about exposure to RA, not social media causing RA.
- The consensus conclusion that “the evidence that social media use increases relational aggression among adolescent girls is preliminary” is well-supported. No major conflict.
Claim 20. Among adolescent girls, social media increases exposure to other people displaying or discussing their mental disorders. (Consensus statement: p. 105-106)
- Expert Assessment of Evidence Level (N=72):
- No evidence: 18.06%
- Correlational: 69.44%
- Causal limited: 27.78%
- Causal field: 12.50%
- Replicated causal field: 9.72%
- Wide-scale causal: 8.33%
- Analysis of Cited Studies:
- Alho et al. (2024): Mental disorders may be socially transmitted in adolescent peer networks (classmates). (Observational, not specific to social media).
- Studies on tic-like behaviors spreading on TikTok: Frey et al. (2022), Giedinghagen (2023), Haltigan et al. (2023), Hull et al. (2021), Olvera et al. (2021). (Observational, case reports, descriptive of a phenomenon).
- Ernala et al. (2018): Research on Twitter suggests reciprocal behavior in self-disclosure of mental illness. (Observational/Content analysis).
- Hamilton et al. (2025): Exposure to self-harm content associated with greater likelihood of urges/engaging in self-harm. (Correlational/Longitudinal).
- Ali et al. (2024): Analyzed DMs about self-harm/suicide on Instagram; found significant helping behavior. (Observational/Content analysis, counter-example).
- Assessment of Cited Studies:
- Causally Informative: The consensus states “no experts provided references for a direct causal link.” The studies are observational, content analyses, or describe phenomena (tic-like behaviors).
- Field Studies: None in the experimental sense.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (69.44%) highest, with “Causal limited” (27.78%) next.
- The cited literature describes associations and observed phenomena but does not establish social media causes increased exposure in an experimental sense (though it’s highly plausible by nature of the platforms).
- The conclusion that future studies should “explore potential causal links” and notes the lack of direct causal link evidence is consistent. No major conflict.
Claim 21. Social media increases sexual predation and harassment of adolescent girls, for example by providing predators with access to potential victims. (Consensus statement: p. 107-108)
- Expert Assessment of Evidence Level (N=59):
- No evidence: 28.81%
- Correlational: 52.54%
- Causal limited: 13.56%
- Causal field: 5.08%
- Replicated causal field: 3.39%
- Wide-scale causal: 11.86%
- Analysis of Cited Studies:
- Common Sense Media survey (Nesi et al., 2023): Girls report being contacted by strangers. (Self-report survey).
- Journalistic investigations (WSJ, 2024a, 2023): Documenting harassment and platforms facilitating illegal behavior.
- Internal Meta survey (Instagram, 2021), Snap study (WSJ, 2024b): Internal industry data on unwanted advances/sextortion.
- Bratton et al. (2020): Correlational study; association between Facebook use and online harassment.
- Bhuller et al. (2013): Quasi-experimental; increased internet access associated with rise in sex crimes (linked to pornography). (Causally suggestive for general internet, not specific social media mechanisms for predation).
- Assessment of Cited Studies:
- Causally Informative: The consensus states “no experts provided experimental studies manipulating social media usage and measuring its effect on harassment or sexual predation.” Bhuller et al. (2013) is quasi-experimental for general internet access.
- Field Studies: Bhuller et al. could be considered a natural field experiment.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “Correlational” (52.54%) highest. “No evidence” is also high (28.81%). Causal ratings are low.
- The cited evidence consists of surveys, journalistic reports, internal industry data, and one quasi-experimental study on general internet access. There’s a lack of direct experimental evidence on social media causing increased predation.
- The conclusion to “investigate whether a causal link exists” is appropriate. No major conflict.
Claim 24. If most parents waited until their children were in high school to give them their first smartphones, it would benefit the mental health of adolescents overall. (Consensus statement: p. 112-113)
- Expert Assessment of Evidence Level (N=93):
- No evidence: 50.68%
- Correlational: 39.73%
- Causal limited: 26.03%
- Causal field: 8.22%
- Replicated causal field: 2.74%
- Wide-scale causal: 5.48%
- Analysis of Cited Studies:
- Dempsey et al. (2020): Found no significant links between early mobile phone ownership and psychosocial outcomes. (Correlational/Longitudinal).
- Vaterlaus et al. (2021): Age of first smartphone had little to no predictive value for later well-being. (Correlational/Longitudinal).
- Gerosa et al. (2024): Contrasting relationships by gender; early ownership positive for girls’ life satisfaction, negative for boys. (Correlational/Longitudinal).
- Sapien Labs (2024): Large survey; mental health outcomes associated with later age of first smartphone/tablet, more pronounced for females. (Correlational).
- Sun (2023): Longitudinal; no significant associations between age of phone acquisition and depression, academic performance, sleep. (Longitudinal).
- Assessment of Cited Studies:
- Causally Informative: None of the cited studies are experimental manipulations of delaying smartphone access. They are all observational/longitudinal.
- Field Studies: No.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “No evidence” (50.68%) highest, followed by “Correlational” (39.73%). Causal ratings are low.
- The cited literature is entirely correlational/longitudinal with mixed findings.
- The consensus conclusion that “the evidence is insufficient to draw conclusions about this claim” is well-supported. No conflict.
Claim 25. Imposing (and enforcing) a legal minimum age of 16 for opening social media accounts would benefit the mental health of adolescents overall. (Consensus statement: p. 114-115)
- Expert Assessment of Evidence Level (N=68):
- No evidence: 70.59%
- Correlational: 26.47%
- Causal limited: 10.29%
- Causal field: 2.94%
- Replicated causal field: 2.94%
- Wide-scale causal: 5.88%
- Analysis of Cited Studies:
- The consensus states: “Experts provided no references that were strictly related to the claim”.
- It then cites general research linking social media to decreased mental health (Twenge & Campbell, 2018; Orben et al., 2022).
- Ferguson (2024) meta-analysis: No significant mental health differences between experimental abstention and control. (Causally informative, but about temporary abstention, not age limits).
- Thrul et al. (2025) re-analysis: Duration of reduction matters. (Methodological re-evaluation of existing studies).
- Assessment of Cited Studies:
- Causally Informative: The consensus explicitly states no direct evidence was provided for this specific policy. Ferguson (2024) and Thrul et al. (2025) are about effects of reducing use, not delaying initial access via age limits.
- Field Studies: No.
- Direct Replications: No.
- Conflict Assessment:
- Experts rate “No evidence” (70.59%) overwhelmingly highest. All other ratings are very low.
- This aligns perfectly with the consensus statement noting the lack of direct evidence. The discussion appropriately shifts to practical/legal/ethical issues and alternative approaches due to this evidentiary gap. No conflict.
Claim 26. Phone-free schools would benefit the mental health of adolescents overall. (Consensus statement: p. 116-117)
- Expert Assessment of Evidence Level (N=77):
- No evidence: 28.57%
- Correlational: 44.16%
- Causal limited: 25.97%
- Causal field: 16.88%
- Replicated causal field: 2.60%
- Wide-scale causal: 11.69%
- Analysis of Cited Studies:
- Böttger & Zierer (2024): Meta-analysis of 5 studies; small significant overall effect, notable benefits in social well-being (bullying). (Synthesizes existing, likely mixed-methodology studies).
- King et al. (2024): Pre-registered study (Australia); no meaningful changes in problematic phone use, engagement, belonging after phone ban. (Observational, natural experiment).
- Campbell et al. (2024): Scoping review; highlighted limitations, lack of RCTs. Small majority of studies reported positive effects on bullying/cyberbullying, similar small majority found no significant impact on mental health/well-being.
- Goodyear et al. (2025): Meta-analysis; comparing restrictive vs. permissive policies, no significant differences in mental health/performance (but only 4/20 schools fully implemented phone-free).
- Assessment of Cited Studies:
- Causally Informative: Studies included in these reviews/meta-analyses are likely observational or quasi-experimental (natural experiments of school bans). King et al. (2024) is a natural experiment. The consensus notes a lack of RCTs.
- Field Studies: School ban studies are inherently field studies.
- Direct Replications: No, methodologies vary.
- Conflict Assessment:
- Experts rate “Correlational” (44.16%) highest, with “No evidence” (28.57%) and “Causal limited” (25.97%) also substantial.
- The cited reviews and studies show mixed and limited evidence. Methodological issues abound.
- The consensus conclusion that “available evidence is too limited and inconsistent to draw conclusions” is strongly supported by the nature of the cited research and the expert ratings. No conflict.
You are absolutely correct. Claims like “Social media increases…” are indeed causal claims, even if they aren’t explicitly about causing a decline in mental health. I will re-evaluate with that in mind, focusing on the “Causal field” expert assessment for all identified causal claims and explicitly noting if any cited studies are experimental field studies.
Let’s go through them again.
Claim 8. Heavy daily use of smartphones and social media can cause sleep deprivation. (Consensus statement: p. 77-79)
- Expert Assessment of Evidence Level (N=105):
- Correlational: 64.76%
- Causal limited: 57.14%
- Causal Field: 20.00%
- Replicated Causal Field: 8.57%
- Wide-scale Causal: 9.52%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Intervention studies (reducing screen time): Martin et al. (2020), Perrault et al. (2019), Tu et al. (2023). These are described as showing that reducing screen time can improve sleep. If these interventions were conducted in participants’ natural environments (e.g., homes, not a sleep lab) and measured sleep objectively (e.g., actigraphy) or even through validated diaries over a period, they could qualify as experimental field studies. The consensus text doesn’t provide enough detail on their methodology to definitively categorize them as such per the strict definition, but they are the strongest candidates.
- Laboratory experiments (Chang et al., 2015) are not field studies.
- Meta-analyses (Schechter et al., 2022; Hartstein et al., 2024) might include experimental field studies, but the summary doesn’t break this down.
- Experimental Field Studies Present? Potentially, through the intervention studies (Martin, Perrault, Tu).
- Expert Assessment of “Causal Field” Evidence: 20.00% of experts rated the evidence as “Causal Field.” This is plausible if the intervention studies are considered as such.
Claim 9. Chronic sleep deprivation can cause a decline in mental health. (Consensus statement: p. 81)
- Expert Assessment of Evidence Level (N=100):
- Correlational: 44.44%
- Causal limited: 45.00%
- Causal field: 26.00%
- Replicated causal field: 30.00%
- Wide-scale causal: 44.00%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Scott et al. (2021): Meta-analysis of sleep-improvement interventions. Improving sleep had a positive effect on mental health. Some of these interventions could have been conducted in field settings (e.g., CBT-I delivered in a community setting with real-world mental health outcomes).
- Pires et al. (2016) (likely experimental) and mechanistic studies (McEwen & Lasley, 2002) are more likely lab-based.
- Experimental Field Studies Present? Potentially, within the interventions reviewed by Scott et al. (2021).
- Expert Assessment of “Causal Field” Evidence: 26.00%. This aligns with the possibility that some interventions reviewed were field-based.
Claim 10. Heavy daily use of smartphones and social media can cause attention fragmentation. (Consensus statement: p. 82-83)
- Expert Assessment of Evidence Level (N=88):
- Correlational: 63.64%
- Causal limited: 57.95%
- Causal field: 9.09%
- Replicated causal field: 5.68%
- Wide-scale causal: 7.95%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- van Wezel et al. (2021): Experimental study (reducing social media); no effect on attention (manipulation failed). This was an intervention in participants’ daily lives, so it could be considered an experimental field study.
- Studies on “mere presence” (Ward et al., 2017; Skowronek et al., 2023) are lab experiments.
- Experimental Field Studies Present? Yes, van Wezel et al. (2021) is a candidate.
- Expert Assessment of “Causal Field” Evidence: 9.09%. This is a low percentage, consistent with only one potential (and inconclusive) field experiment being cited.
Claim 11. Attention fragmentation can cause a decline in mental health… (Consensus statement: p. 84-85)
- Expert Assessment of Evidence Level (N=54):
- Correlational: 48.15%
- Causal limited: 25.93%
- Causal field: 7.41%
- Replicated causal field: 7.41%
- Wide-scale causal: 5.55%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Intervention studies (disabling notifications): Fitz et al. (2019), Olson et al. (2023), Dekker et al. (2024), Liao et al. (2022). These interventions, if conducted in participants’ natural environments with real-world well-being measures, would be experimental field studies.
- Experimental Field Studies Present? Potentially, through the notification intervention studies.
- Expert Assessment of “Causal Field” Evidence: 7.41%. This is low, suggesting experts are cautious about the strength or directness of these field interventions for this specific broad claim.
Claim 12. Heavy daily use of smartphones and social media can cause behavioral addiction. (Consensus statement: p. 87-88)
- Expert Assessment of Evidence Level (N=94):
- Correlational: 64.89%
- Causal limited: 48.93%
- Causal field: 19.15%
- Replicated causal field: 10.64%
- Wide-scale causal: 11.70%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Allcott et al. (2022): Explicitly called a field study suggesting social media usage may involve self-control problems and be habit-forming. This is an experimental field study (likely involving an intervention or random assignment in a real-world context).
- Experimental Field Studies Present? Yes, Allcott et al. (2022).
- Expert Assessment of “Causal Field” Evidence: 19.15%. This aligns with one prominent field experiment being cited.
Claim 13. Behavioral addiction can cause a decline in mental health. (Consensus statement: p. 90)
- Expert Assessment of Evidence Level (N=81):
- Correlational: 50.62%
- Causal limited: 41.96%
- Causal field: 19.75%
- Replicated causal field: 23.46%
- Wide-scale causal: 34.57%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus mentions Hartmann & Blaszczynski (2018) showed gambling problems can lead to increased depression/anxiety over time. While longitudinal, it’s observational rather than an experimental field study manipulating addiction to see its effect on mental health. The claim is broad (“behavioral addiction”). It’s difficult to ethically conduct experimental field studies where addiction is induced to observe mental health decline. Treatment studies (interventions for addiction showing improved mental health) would be more common and could be field-based, but aren’t the primary focus of this claim’s wording.
- Experimental Field Studies Present? Unlikely for causing decline through manipulation of addiction itself in a field setting. Intervention studies for addiction that measure mental health outcomes in the field would be relevant but focus on reversal.
- Expert Assessment of “Causal Field” Evidence: 19.75%. This might reflect interpretations of strong longitudinal data or broader clinical field observations rather than direct experimental field manipulations of causing decline.
Claim 14. Heavy daily use of smartphones and social media can cause social deprivation… (Consensus statement: p. 92-93)
- Expert Assessment of Evidence Level (N=83):
- Correlational: 63.86%
- Causal limited: 42.17%
- Causal field: 14.46%
- Replicated causal field: 9.64%
- Wide-scale causal: 8.43%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Geraci et al. (2022): Quasi-experimental study; higher connection speeds/internet use can lead to reduced social engagement. This could be a natural experiment in the field.
- Allcott et al. (2020): Experimental study (deactivating Facebook); results in more in-person socialization but also more solitary TV. This was a field experiment.
- Experimental Field Studies Present? Yes, Allcott et al. (2020) and potentially Geraci et al. (2022) as a natural field experiment.
- Expert Assessment of “Causal Field” Evidence: 14.46%. This is consistent with a couple of relevant studies.
Claim 15. Chronic social deprivation can cause a decline in mental health. (Consensus statement: p. 95)
- Expert Assessment of Evidence Level (N=91):
- Correlational: 49.45%
- Causal limited: 51.65%
- Causal field: 32.97%
- Replicated causal field: 27.47%
- Wide-scale causal: 37.36%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Killgore et al. (2020): Analyzed mental health data across lockdown periods (COVID-19). This is a natural experiment in a field setting.
- Experimental studies (Alvarez et al., 2024; Costello et al., 2022; Lai et al., 2020): Increasing social interactions improves mental health. These are interventions. If conducted in real-world settings with community participants, they would be experimental field studies.
- Experimental Field Studies Present? Yes, Killgore et al. (2020) as a natural field experiment, and potentially the intervention studies.
- Expert Assessment of “Causal Field” Evidence: 32.97%. This aligns well with the presence of a significant natural experiment and other intervention studies that are likely field-based.
Claim 17. Social media increases visual social comparisons among adolescent girls. (Consensus statement: p. 98-100)
- Expert Assessment of Evidence Level (N=96):
- Correlational: 57.29%
- Causal limited: 44.79%
- Causal Field: 11.46%
- Replicated Causal Field: 12.50%
- Wide-scale Causal: 8.33%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Fardouly et al. (2015), Kleemans et al. (2018), Tiggemann & Anderberg (2020): These are described as experimental studies. They involve manipulating exposure to social media content (or similar) and measuring outcomes. These are generally lab or online experiments, not field studies according to the strict definition (real-world, consequential settings, objective measures like official statistics for the outcome of social comparison itself). The outcomes are typically self-reported comparison tendencies or body dissatisfaction.
- Experimental Field Studies Present? Unlikely, based on the descriptions. The experiments are more controlled (lab/online) settings.
- Expert Assessment of “Causal Field” Evidence: 11.46%. This percentage seems a bit high if the cited experimental studies are primarily lab/online. Experts might be interpreting “field” more broadly or considering other non-cited studies.
Claim 18. Social media increases perfectionism among adolescent girls. (Consensus statement: p. 101-102)
- Expert Assessment of Evidence Level (N=72):
- Correlational: 63.89%
- Causal limited: 27.78%
- Causal field: 9.72%
- Replicated causal field: 5.55%
- Wide-scale causal: 5.55%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus states: “experts provided no evidence from experimental studies directly manipulating social media exposure to measure its effect on perfectionism.”
- Vanhoffelen et al. (2025): Longitudinal study offered weak evidence. Not an experimental field study.
- Experimental Field Studies Present? No, as per the consensus statement.
- Expert Assessment of “Causal Field” Evidence: 9.72%. Given the lack of cited experimental (field or lab) evidence directly manipulating social media to measure perfectionism, this rating is likely based on broader interpretations or non-cited work.
Claim 19. Social media increases relational aggression among adolescent girls… (Consensus statement: p. 103-104)
- Expert Assessment of Evidence Level (N=71):
- Correlational: 69.01%
- Causal limited: 30.99%
- Causal field: 8.45%
- Replicated causal field: 1.41%
- Wide-scale causal: 4.23%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus states: “no experts provided experimental evidence establishing a causal link between social media use and increased relational aggression.”
- Martins & Weaver (2019) meta-analysis included experimental studies on “exposure to RA -> RA behaviors,” not specific social media use causing RA.
- Lapidot-Lefler et al. (2012) lab study on adults and general toxic behavior.
- Experimental Field Studies Present? No, for the direct claim.
- Expert Assessment of “Causal Field” Evidence: 8.45%. This is low and consistent with the lack of direct experimental field evidence.
Claim 20. Among adolescent girls, social media increases exposure to other people displaying or discussing their mental disorders. (Consensus statement: p. 105-106)
- Expert Assessment of Evidence Level (N=72):
- Correlational: 69.44%
- Causal limited: 27.78%
- Causal field: 12.50%
- Replicated causal field: 9.72%
- Wide-scale causal: 8.33%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus states: “no experts provided references for a direct causal link.”
- Studies cited are observational (Alho et al. on peer networks; phenomena on TikTok; Ernala et al. on Twitter disclosures; Hamilton et al. on self-harm content exposure; Ali et al. on Instagram DMs). These describe patterns of exposure or association, not experimental manipulation of social media use to measure changes in exposure levels in a field setting.
- Experimental Field Studies Present? No.
- Expert Assessment of “Causal Field” Evidence: 12.50%. This rating seems to suggest some experts believe such evidence exists or interpret observational field data more strongly in a causal direction for this specific type of “increase.”
Claim 21. Social media increases sexual predation and harassment of adolescent girls… (Consensus statement: p. 107-108)
- Expert Assessment of Evidence Level (N=59):
- Correlational: 52.54%
- Causal limited: 13.56%
- Causal field: 5.08%
- Replicated causal field: 3.39%
- Wide-scale causal: 11.86%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus states: “no experts provided experimental studies manipulating social media usage and measuring its effect on harassment or sexual predation.”
- Bhuller et al. (2013): Quasi-experimental on general internet access and sex crimes. This is a natural field experiment.
- Experimental Field Studies Present? Yes, Bhuller et al. (2013) for general internet, but not specifically social media mechanisms for predation/harassment.
- Expert Assessment of “Causal Field” Evidence: 5.08%. Very low, consistent with the lack of direct social media field experiments.
Claim 24. If most parents waited… it would benefit the mental health of adolescents overall. (Consensus statement: p. 112-113)
- Expert Assessment of Evidence Level (N=93):
- Correlational: 39.73%
- Causal limited: 26.03%
- Causal field: 8.22%
- Replicated causal field: 2.74%
- Wide-scale causal: 5.48%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- The consensus notes studies are correlational/longitudinal (Dempsey et al., 2020; Vaterlaus et al., 2021; Gerosa et al., 2024; Sapien Labs, 2024; Sun, 2023). None are experimental field studies manipulating delayed smartphone access for a cohort and measuring mental health. The discussion of “group-level effects” is hypothetical due to lack of data.
- Experimental Field Studies Present? No.
- Expert Assessment of “Causal Field” Evidence: 8.22%. Low, reflecting the absence of such studies.
Claim 25. Imposing (and enforcing) a legal minimum age of 16 for opening social media accounts would benefit the mental health of adolescents overall. (Consensus statement: p. 114-115)
- Expert Assessment of Evidence Level (N=68):
- Correlational: 26.47%
- Causal limited: 10.29%
- Causal field: 2.94%
- Replicated causal field: 2.94%
- Wide-scale causal: 5.88%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Consensus: “Experts provided no references that were strictly related to the claim.” Ferguson (2024) and Thrul et al. (2025) are about effects of reducing use, not delaying initial access via legal age limits in a field experiment.
- Experimental Field Studies Present? No.
- Expert Assessment of “Causal Field” Evidence: 2.94%. Extremely low, reflecting the absence of such policy evaluation field experiments.
Claim 26. Phone-free schools would benefit the mental health of adolescents overall. (Consensus statement: p. 116-117)
- Expert Assessment of Evidence Level (N=77):
- Correlational: 44.16%
- Causal limited: 25.97%
- Causal field: 16.88%
- Replicated causal field: 2.60%
- Wide-scale causal: 11.69%
- Analysis of Cited Studies (Focus on Experimental Field Studies):
- Böttger & Zierer (2024) meta-analysis: Reviews 5 studies on school bans. These are natural experiments in field settings.
- King et al. (2024): Pre-registered study of a phone ban. This is a natural experiment in a field setting.
- Campbell et al. (2024) scoping review: Includes studies of school bans (natural field experiments).
- Goodyear et al. (2025) meta-analysis: Compares restrictive vs. permissive policies in schools (natural field experiments).
- Experimental Field Studies Present? Yes, studies of school phone bans are natural experiments in field settings.
- Expert Assessment of “Causal Field” Evidence: 16.88%. This is consistent with several (though limited in number and with mixed/methodologically critiqued results) natural field experiments being available.
Summary of Experimental Field Studies & Expert Assessment:
- Claim 8 (Sleep Deprivation): Potentially yes (intervention studies). 20.00% experts say Causal Field.
- Claim 9 (Sleep Dep & MH Decline): Potentially yes (interventions). 26.00% experts say Causal Field.
- Claim 10 (Attention Frag.): Potentially one (van Wezel et al.). 9.09% experts say Causal Field.
- Claim 11 (Attention Frag & MH Decline): Potentially yes (notification interventions). 7.41% experts say Causal Field.
- Claim 12 (Behavioral Addiction): Yes (Allcott et al.). 19.15% experts say Causal Field.
- Claim 13 (Behav. Add. & MH Decline): Unlikely direct experimental field study causing decline. 19.75% experts say Causal Field (likely broader interpretation).
- Claim 14 (Social Deprivation): Yes (Allcott et al., Geraci et al.). 14.46% experts say Causal Field.
- Claim 15 (Social Dep. & MH Decline): Yes (Killgore et al., interventions). 32.97% experts say Causal Field.
- Claim 17 (Visual Social Comp.): Unlikely from cited studies. 11.46% experts say Causal Field.
- Claim 18 (Perfectionism): No from cited studies. 9.72% experts say Causal Field.
- Claim 19 (Relational Aggression): No direct from cited studies. 8.45% experts say Causal Field.
- Claim 20 (Exposure to Disorders): No from cited studies. 12.50% experts say Causal Field.
- Claim 21 (Sexual Predation): Yes, for general internet (Bhuller et al.). 5.08% experts say Causal Field.
- Claim 24 (Delay Smartphones & MH): No from cited studies. 8.22% experts say Causal Field.
- Claim 25 (Age Limits & MH): No from cited studies. 2.94% experts say Causal Field.
- Claim 26 (Phone-Free Schools & MH): Yes (school ban studies). 16.88% experts say Causal Field.
It seems for many claims directly testing “Social media increases X” with experimental field studies, the evidence is sparse or non-existent according to the cited literature. The expert ratings for “Causal Field” are generally low for these, but sometimes higher than what the cited experimental field literature in the consensus statements would directly support, suggesting experts might be drawing on a wider body of knowledge or interpreting observational field data more causally. For policy-related claims, the expert assessment of causal field evidence is also very low, reflecting the difficulty and rarity of such large-scale experimental policy evaluations.
Request for summary table
First draft:
So, experts could endorse multiple levels of evidence (despite some of them being mutually exclusive). Do the authors describe how they recoded these responses to make a graph where categories sum to 100%? Highest/lowest level of evidence endorsed?
Second draft:
Earlier I asked for experimental field studies, but causal evidence could also come from natural experiments or well-controlled designs. Please revise the table and assess whether an experiment or natural experiment in the field supporting the claim is reported. If the text in Section S5 makes the case that an observational study can be used to back up the causal claim, mention it.
Also, don’t sum up the reported percentages for causal field and above since they are not mutually exclusive. Instead, try to read it off the plot and also report the percentage who say causal field in another column.