Logo image
SAFE: Synthetic Audio Forensics Evaluation Challenge
Conference proceeding   Open access

SAFE: Synthetic Audio Forensics Evaluation Challenge

Kirill Trapeznikov, Paul Cummer, Pranay Pherwani, Jai Aslam, Michael Davinroy, Peter Bautista, Laura Cassani and Matthew C Stamm
IH&MMSEC '25: Proceedings of the 2025 ACM Workshop on Information Hiding and Multimedia Security, pp 174-180
17 Jun 2025
url
https://doi.org/10.1145/3733102.3736707View
Published, Version of Record (VoR)Open Access via Drexel Libraries Read and Publish Program 2025CC BY V4.0 Open

Abstract

synthetic audio detection text-to-speech voice cloning
The increasing realism of synthetic speech generated by advanced text-to-speech (TTS) models, coupled with post-processing and laundering techniques, presents a significant challenge for audio forensic detection. In this paper, we introduce the SAFE (Synthetic Audio Forensics Evaluation) Challenge, a fully blind evaluation framework designed to benchmark detection models across progressively harder scenarios: raw synthetic speech, processed audio (e.g., compression, resampling), and laundered audio intended to evade forensic analysis. The SAFE challenge consisted of a total of 90 hours of audio and 21k audio samples split across 21 different real sources and 17 different TTS models and 3 tasks. We present the challenge, evaluation design and tasks, dataset details, and initial insights into the strengths and limitations of current approaches, offering a foundation for advancing synthetic audio detection research. More information is available at https://stresearch.github.io/SAFE/.

Metrics

7 Record Views

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Industry collaboration
Domestic collaboration
Web of Science research areas
Computer Science, Information Systems
Computer Science, Theory & Methods
Mathematics, Applied
Logo image