Logo image
Enhancing Patient-Physician Communication: Simulating African American Vernacular English in Medical Diagnostics with Large Language Models
Journal article   Open access

Enhancing Patient-Physician Communication: Simulating African American Vernacular English in Medical Diagnostics with Large Language Models

Yeawon Lee, Chia-Hsuan Chang and Christopher C Yang
Journal of healthcare informatics research
11 Mar 2025
url
https://doi.org/10.1007/s41666-025-00194-9View
Published, Version of Record (VoR)Open Access via Drexel Libraries Read and Publish Program 2025CC BY V4.0 Open

Abstract

Large Language Model Health Disparities Communications
Effective communication is crucial in reducing health disparities. However, linguistic differences, such as African American Vernacular English (AAVE), can lead to communication gaps between patients and physicians, negatively affecting care and outcomes. This study examines whether large language models (LLMs), specifically GPT-4 and Llama 3.3, can replicate AAVE in simulated clinical dialogues to improve cultural sensitivity. We tested four prompt types—BaseP, DemoP, LingP, and CompP—using United States Medical Licensing Examination (USMLE) case simulations. Statistical analyses on the models’ outputs showed a significant difference among prompt types for both GPT-4 (F(2,70) = 6.218, p = 0.003) and Llama 3.3 (F(2,70) = 12.124, p < 0.001), indicating that including demographic information and/or explicit AAVE cues influences each model’s output. Combining demographic and linguistic cues (CompP) yielded the highest mean AAVE feature counts (e.g., 9.83 for GPT-4 vs. 16.06 for Llama 3.3), although neither model fully captured the diversity of AAVE. Moreover, simply mentioning African American demographics triggers extra informal forms, suggesting built-in stereotypes or biases in both models. Overall, these findings highlight the promise of LLMs for culturally sensitive healthcare communication, while underscoring the need for continued refinement to address stereotypes and more accurately represent diverse linguistic styles.

Metrics

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
Web of Science research areas
Computer Science, Information Systems
Health Care Sciences & Services
Medical Informatics
Logo image