2024
DOI: 10.31219/osf.io/wvxy2
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

More Distinctively Black and Feminine Faces Lead to Increased Stereotyping in Vision-Language Models

Messi Ho Jun Lee,
Jacob Montgomery,
Calvin K. Lai

Abstract: Vision Language Models (VLMs), exemplified by GPT-4V, adeptly integrate text and vision modalities. This integration enhances Large Language Models' ability to mimic human perception, allowing them to process image inputs. Despite VLMs' advanced capabilities, however, there is a concern that VLMs inherit biases of both modalities in ways that make biases more pervasive and difficult to mitigate. Our study explores how VLMs perpetuate homogeneity bias and trait associations with regards to race and gender. When… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 43 publications
(48 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?