-

1613-0073

Genders in Large Language M odels - Abstract

Lucille Njoo

lnjoo@cs.washington.edu 0

Lee Janzen-Morel

Inna Wanyin Lin

Yulia Tsvetkov

0 0 Paul G. Allen School of Computer Science, University of Washington , 185 E Stevens Way NE, Seattle, WA 98195. United States

2024

Mental health stigma manifests diferently for diferent genders, often being more associated with women and overlooked with men. Prior work in NLP has shown that gendered mental health stigmas are captured in large language models (LLMs). However, in the last year, LLMs have changed drastically: newer, generative models not only require diferent methods for measuring bias, but they also have become widely popular in society, interacting with millions of users and increasing the stakes of perpetuating gendered mental health stereotypes. In this paper, we examine gendered mental health stigma in GPT3.5-Turbo, the model that powers OpenAI's popular ChatGPT. Building of of prior work, we conduct both quantitative and qualitative analyses to measure GPT3.5-Turbo's bias between binary genders, as well as to explore its behavior around non-binary genders, in conversations about mental health. We find that, though GPT3.5-Turbo refrains from explicitly assuming gender, it still contains implicit gender biases when asked to complete sentences about mental health, consistently preferring female names over male names. Additionally, though GPT3.5-Turbo shows awareness of the nuances of non-binary people's experiences, it often over-fixates on non-binary gender identities in free-response prompts. Our preliminary results demonstrate that while modern generative LLMs contain safeguards against blatant gender biases and have progressed in their inclusiveness of non-binary identities, they still implicitly encode gendered mental health stigma, and thus risk perpetuating harmful stereotypes in mental health contexts.

Machine Learning for Cognitive and Mental Health Workshop

NLP, large language models, bias, fairness, gender, mental health, stigma, intersectionality CEUR