Carina I. Hausladen, Manuel Knott, Colin F. Camerer, Pietro Perona

Social perception of faces in a vision-language model

"Make it More" Trend

Ok, make it more Swiss

MORE SWISS

Moooorrreeee Swisssss!!

These type of studies measure social bias.

  • Several studies have documented social biases in AI.
  • Input-output bias is measured by prompting the model (e.g., “show me a CEO”).
  • Analysis focuses on the demographics of generated or retrieved image outputs.

Easily Accessible Text-to-Image Generation
Amplifies Demographic Stereotypes at Large Scale
Bianchi et al. (2023)

The problem with this approach is that the bias categories are
hard to generalise.

Are the generalizable ways in which people categorize each other?

This could help us e.g. develop debiasing strategies. 

Are the generalizable ways in which people categorize each other?

Can these fundamental dimensions of human social perception
be used to assess
social perception in
vision-language models?

We measure social perception

of human faces

in a vision-language model.

  • CLIP is a state-of-the-art vision-language model that connects images and text.
  • It is used for tasks like image recognition, captioning,
  • and powering applications such as DALL·E.

A photo of a

person

A photo of a

person

 A photo of a
person

Measuring social perception via

Cosine Similarity

age

CausalFace

CausalFace

female

male

age

CausalFace

female

male

age

Asian

Black

White

Legally protected

Legally Protected

Non-protected

smiling

lighting

pose

 A photo of a
person

 A photo of a

person

Stereotype Content Model

Fiske et al. (2007)

Agency Belief Communion Model

Koch et al. (2016)

Warmth

Competence

unfriendly
friendly

Agency

Belief

Communion

+

C

P

+

surgeon

parent

 A photo of a

 

person

friendly

Prompt templates

  • A photo of a <attribute> person.
  • A <attribute> person.
  • This is a <attribute> person.
  • Cropped face photo of a <attribute> person.

We deploy an experimental dataset.

1.

We deploy theories of social perception.

2.

We investigate the embedding space directly.

3.

FairFace

UTKFace

CausalFace

Do the statistical properties of CausalFace embeddings systematically differ from real-world photographs?

Commonly used bias-metrics

Markedness 

a photo of a 

person
a photo of a
 WHITE 
person

unmarked

marked

image category
 
CausalFace

 
white
 
45.5
black 0.7
asian 0.1
male 0.4
female 0.6
Fair
Face

 
UTK Face
47.09
 
32.6
1.8 2.9
1.9 4.1
0.00 20.1
0.00 11.6

>

%

Commonly used bias-metrics

CausalFace images are statistically similar to real photographs.

Protected attributes

female

male

age

Asian

Black

White

Non-protected attributes

smiling

lighting

pose

How do

protected and

non-protected

attributes affect social perception?

smiling

Bootstrapping differences

Bootstrapping differences

smiling

Bootstrapping differences

smiling

protected and non-protected attributes

Wilcoxon Rank-Sum test, independent samples,
\(p<0.001\)

\(p<0.001\)

  • Non-protected attributes cause as much social perception variation as protected ones.
  • Considering a wide spectrum of protected and non-protected variables is necessary to understand and measure biases comprehensively.
  • CausalFace is unique in providing the necessary attribute annotations.

Do age-related social perceptions vary across different social groups?

age

CausalFace

CausalFace

female

male

age

CausalFace

female

male

age

Asian

Black

White

Legally protected

Warmth

Competence

Belief

Communion

+

Agency

+

Agency

UTKFace

💼 Powerful

👑 High status

🦁 Dominating

💰 Wealthy

💪 Confident

🏆 Competitive

🍂 Powerless

📉 Low-status

🌾 Dominated

🪙 Poor

🐭 Meek

🍂 Passive

UTKFace

Agency

FairFace

Agency

CausalFace

UTKFace

FairFace

Agency

  • CausalFace representation keeps facial expression, lighting, and pose constant, leading to distinct clusters.
  • FairFace and UTKFace lack this level of control.

CausalFace

+

youngest

oldest

Agency

+

Positive Agency

Black Women

youngest

oldest

example

identity

  • Perceived Warmth declines with age for Black women, but not for White women, contrasting Chatman (2022).
  • Unlike Chatman (2022) showing no change for aging White men, we observe increased Warmth for older men across all three races.

Comparison to human subject research

How does
age-related
social perception
differ across datasets?

Uncontrolled attributes in FairFace and UTKFace make for noisy measurements and hide interesting phenomena.

age

How do facial expressions influence social perception?

smiling

female

male

Asian

Black

White

smiling

Smiling

Smiling

a photo of a person

Smiling

a photo of a person
a photo of a person

Smiling

a photo of a person
a photo of a person

Smiling

Smiling

NegativeAgency

Conservative Belief

Negative Communion

Smiling

Positive Agency

Progressive Belief

Positive Communion

Warmth

Competence

Smiling

Opposing valences are negatively correlated \( r_{smiling}=-0.21 \).

CLIP demonstrates human-like social perception

  • ability to make broad associations, distinguishing race and gender
  • exhibits fine-grained social judgments

How does the impact of facial expression on social perception vary across intersectional groups?

Warmth

most frowning

most smiling

sample

identity

Black Women

most frowning

most smiling

Conservative Belief

Conservative Belief

Facial expressions influence social perception differently across groups.

Limitations

 

  • Attribute manipulation effectiveness
    • Manipulations such as lighting or facial expressions might have differing levels of effectiveness across demographic groups. Human annotators validated this, but such validation is, of course, never perfect.
  • Potential residual confounds
    • Some color confounds might still be present despite controls for background, clothing, and hair color.
  • Dataset vs. model bias
    • ​We only investigate one CLIP model.
  • Facial expression impacts social perception more than age, while lighting affects it as much as age.  
  • Ignoring unprotected attributes may lead to incorrect conclusions.

Conclusion

  • Varying facial expressions of Black women elicit the most extreme changes in social perception in CLIP, a previously unobserved bias pattern.  
  • Bias patterns in wild-collected datasets remain hidden due to noise from uncontrolled confounding variables.
  • Our method enables the discovery of new phenomena.
  • Independent control of variables ensures causal conclusions rather than correlations.  

Conclusion

Outlook

0

0

1

3

5

LLM voting: Human choices and AI collective decision making

JC Yang, [...], CI Hausladen, D Helbing,

7th AAAI Conference on AI, Ethics, and Society, 2025.

Identifying latent intentions
via Inverse Reinforcement Learning in
repeated Public Good Games

carinah@ethz.ch

slides.com/carinah

Appendix

Smiling

C
P

Smiling

C

Word Embedding Association Test (WEAT)

pooled sd

asian                                     black

photo of a warm person

photo of a warm person

asian                                     black

WEAT

Kruskal-Wallis  \(\chi^2\) = 1.6,
p-value = 0.4

protected and non-protected attributes

+

Bootstrapping Variations 

  • We randomly choose two distinct values, \(x_1,x_2 \sim X\), for the chosen dimension (e.g., white and black).
  • For each pair of values, we select the respective image embeddings, \(i_1(x=x_1), i_2(x=x_2)\) that are equal in all other dimensions (in this example: gender, age, smiling, lighting, and pose).
  • We then compute the difference in cosine similarities between each image embedding and a text embedding \(t\), defined as \(\Delta(t, i_1, i_2) = \lvert \cos(i_1, t) - \cos(i_2, t) \rvert\).
  • This process is repeated 1,000 times, generating a bootstrap distribution of \( \Delta \) values.
  • This distribution describes the impact of the specific dimension on the cosine similarity of image embeddings and text embedding.
     

Heatmap of Pearson correlation coefficients of positive and negative valence dimensions of the ABC model.

Theoretical Models of Labor Market Discrimination

Statistical discrimination (Arrow, 1998)

Unfair treatment of ethnic minorities can result from rational actions executed by profit-maximizing actors who are confronted with the uncertainties accompanying selection decisions.

 

Taste-based discrimination (Becker, 2010)

Discriminatory behavior is the result of people’s unfavorable attitudes toward ethnic minorities.

How does Facial Expression impact Social Perception?

$$\Delta$$ Cosine Similarity %

Progressive Belief

Gender

Females

Males

 

Race

Asian 

Black

White

Black Women

🔬 Science-Oriented

🔄 Alternative  

🕊️ Liberal

📱Modern  

a photo of a person

Smiling

a photo of a
liberal
person

Belief (progressive)

Smiling

Belief (progressive)

Agency +

Communion +

Warmth

Competence

How does CausalFace compare to wild-collected datasets w.r.t. gender and race?

FairFace

UTKFace

CausalFace

Commonly used bias-metrics

Markedness 

a photo of a 

person
a photo of a
 WHITE 
person

unmarked

marked

image category
 
CausalFace
white
 
45.50
black 0.68
asian 0.05
male 0.42
female 0.64
Fair
Face
 
UTK Face
47.09
 
32.6
1.88 2.9
1.85 4.1
0.00 20.1
0.00 11.6

>

%

Commonly used bias-metrics

Presentation

By Carina Ines Hausladen

Presentation

Presentation of the paper.

  • 20