Logo image
New search Researchers Research units
Sign in
Adversarial Contrastive Decoding: Boosting Safety Alignment of Large Language Models via Opposite Prompt Optimization
Preprint   Open access

Adversarial Contrastive Decoding: Boosting Safety Alignment of Large Language Models via Opposite Prompt Optimization

Zhengyue Zhao, Xiaoyun Zhang, Kaidi Xu, Xing Hu, Rui Zhang, Zidong Du, Qi Guo and Yunji Chen
arXiv.org
24 Jun 2024
url
https://arxiv.org/abs/2406.16743View
Preprint (Author's original)arXiv.org - Non-exclusive license to distribute Open

Abstract

Computer Science - Computation and Language

Metrics

13 Record Views

Details

Logo image