Journal article
Accuracy of Autism-Related TikTok Information in Italian: A Comparison Between Human Raters and Large Language Models
Journal of autism and developmental disorders
18 Feb 2026
PMID: 41706307
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Purpose Social networking sites are major channels for sharing information on neurodiversity, including autism spectrum disorder. TikTok has become a particularly influential platform for autism-related communication, yet concerns remain about the scientific accuracy of such content. Most prior studies have focused on English-language videos and have evaluated accuracy with limited granularity. Additionally, the difficulty of achieving consistent expert ratings underscores the need for automated reliability assessment.
Methods In this study, we examined 408 informational statements extracted from 148 TikTok videos posted under the hashtag #Autismo (Italian for #Autism). Three clinical experts independently classified each statement as inaccurate, overgeneralized, or accurate; their median ratings served as the human-derived ground truth and were compared with classifications from two large language models: ChatGPT 4.0 mini and Gemini 1.5 Flash.
Results Human raters showed moderate agreement (κmean = 0.52) and high specific agreement only for accurate statements, with lower agreement for overgeneralized and inaccurate content. ChatGPT achieved moderate agreement with human ratings (κ = 0.58), while Gemini reached only fair agreement (κ = 0.29). ChatGPT also exhibited a more conservative evaluation pattern (accurate information: precision = 0.89, recall = 0.82), whereas Gemini tended to overestimate accuracy (accurate information: precision = 0.76, recall = 0.93).
Conclusion These findings suggest that LLMs, particularly ChatGPT, may support cautious and assistive evaluation of online health content. Future research should assess their applicability across online communities and platforms and explore their integration into accuracy-based alert systems that provide users with contextual reliability cues.
Metrics
3 Record Views
Details
- Title
- Accuracy of Autism-Related TikTok Information in Italian: A Comparison Between Human Raters and Large Language Models
- Creators
- Alessandro Carollo - University of TrentoSeraphina Fong - University of TrentoGiovanni Belardinelli - University of TrentoSilvia Perzolli - University of TrentoGiacomo Vivanti - Drexel University, Psychological and Brain Sciences (Psychology)Daniel Messinger - University of MiamiDagmara Dimitriou - University College LondonGianluca Esposito - University of Trento
- Publication Details
- Journal of autism and developmental disorders
- Publisher
- SPRINGER/PLENUM PUBLISHERS
- Number of pages
- 9
- Grant note
- Universit degli Studi di Trento
Open access funding provided by Universita degli Studi di Trento within the CRUI-CARE Agreement.
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Psychological and Brain Sciences (Psychology); A.J. Drexel Autism Institute
- Web of Science ID
- WOS:001694604100001
- Scopus ID
- 2-s2.0-105030513460
- Other Identifier
- 991022161041504721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
Source: SDGs in the Output
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- International collaboration
- Web of Science research areas
- Psychology, Developmental
Related media
Research
Autismo su TikTok: la ricerca dell’Università di Trento tra accuratezza e semplificazioni
La Voce Del Trentino (Redazione Trento)