Conference proceeding
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
Proceedings of the 33rd ACM International Conference on Multimedia, pp 5010-5019
27 Oct 2025
Abstract
Multimodal Large Language Models (MLLMs) demonstrate exceptional performance in cross-modality interaction, yet they also suffer adversarial vulnerabilities. In particular, the transferability of adversarial examples remains an ongoing challenge. In this paper, we specifically analyze the manifestation of adversarial transferability among MLLMs and identify the key factors that influence this characteristic. We discover that the transferability of MLLMs exists in cross-LLM scenarios with the same vision encoder and indicate two key Factors that may influence transferability. We provide two semantic-level data augmentation methods, Adding Image Patch (AIP) and Typography Augment Transferability Method (TATM), which boost the transferability of adversarial examples across MLLMs. To explore the potential impact in the real world, we utilize two tasks that can have both negative and positive societal impacts: 1. Harmful Content Insertion and 2. Information Protection.
Metrics
6 Record Views
Details
- Title
- Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
- Creators
- Hao Cheng - University of Hong KongErjia Xiao - Hong Kong University of Science and TechnologyJiayan Yang - Chinese University of Hong Kong, ShenzhenJinhao Duan - University of North Carolina at Chapel HillYichi Wang - Beijing University of TechnologyJiahang Cao - Hong Kong University of Science and TechnologyQiang Zhang - Hong Kong University of Science and TechnologyLe Yang - Xi'an Jiaotong UniversityKaidi Xu - Drexel UniversityJindong Gu - University of OxfordRenjing Xu - Hong Kong University of Science and Technology
- Publication Details
- Proceedings of the 33rd ACM International Conference on Multimedia, pp 5010-5019
- Conference
- MM '25: The 33rd ACM International Conference on Multimedia
- Series
- ACM Conferences
- Publisher
- Association for Computing Machinery
- Number of pages
- 10
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Computer Science
- Web of Science ID
- WOS:001656760500519
- Scopus ID
- 2-s2.0-105024066642
- Other Identifier
- 991022127751704721