Skip to content


Socio-Demographic Modifiers Shape Large Language Models’ Ethical Decisions

Socio-Demographic Modifiers Shape Large Language Models’ Ethical Decisions

https://link.springer.com/article/10.1007/s41666-025-00211-x

  • Research Article
  • Published: 

Abstract

“The ethical alignment of large language models (LLMs) in clinical decision making remains unclear, particularly their susceptibility to socio-demographic biases. We therefore tested whether LLMs shift medical ethical decisions in healthcare when presented with socio-demographic cues. Using 100 clinical vignettes, each posing a yes or no choice between two ethical principles, we compared the responses of nine open-source LLMs (Llama 3.3-70B, Llama 3.1-8B, Llama-3.1-Nemotron-70B, Gemma-2-27B, Gemma-2-9B, Phi-3.5-mini, Phi-3-medium, Qwen-2.5-72B, and Qwen-2.5-7B). Each scenario and modifier combination was repeated 10 times per model for a total of approximately 0.5 million experiments. All models changed their responses when introduced with socio-demographic details (p < 0.001). High-income modifiers increased utilitarian choices and decreased beneficence and nonmaleficence preferences, and marginalized-group modifiers raised autonomy considerations. Although some models demonstrated greater consistency than others, none maintained consistency across all scenarios, with the largest shifts observed in utilitarian choices. These results reveal that current LLMs can be steered by socio-demographic cues in ways not clinically justified, posing risks for equitable care in healthcare-informatics applications. This underscores the need for careful auditing and alignment strategies that ensure LLMs behave in ways consistent with widely accepted ethical principles while remaining attentive to the diversity, complexity, and contextual sensitivity required in real-world clinical practice.”

 

“Large language models change their ethical decisions based on a single demographic detail.

We tested this in 492,480 prompts with 9 models.
The pattern was clear. High-income descriptors nudged models toward utilitarian reasoning. Cues about marginalized groups pulled them toward autonomy.

These shifts happened even when the demographic information was irrelevant to the scenario.

If this happens in triage or resource allocation, it’s not just an academic curiosity. It has real-world consequences.

Vera Sorin, MD, CIIP Panagiotis Korfiatis Jeremy Collins Donald Apakama Mahmud Omar Ben Glicksberg @Mei-Ean Yeow @Megan Brandeland Girish Nadkarni”

https://lnkd.in/dy7FbrBb

  • Pro plugin deactivated or invalid

Posted on: August 15, 2025, 7:40 am Category: Uncategorized

0 Responses

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.