INDEX

Explanations

gender stereotypes

discussions of gender roles and stereotypes, especially societal expectations distinguishing boys and girls or men and women.

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

_missing

-0.08

 substring

-0.08

 procesa

-0.08

-0.07

 основания

-0.07

artz

-0.07

黎

-0.07

wall

-0.07

ighbors

-0.07

missing

-0.07

POSITIVE LOGITS

 stereotyp

0.10

.RO

0.09

 swapped

0.09

 റോ

0.09

 heterosexual

0.09

 roteiro

0.09

 hybr

0.08

 societal

0.08

 perceived

0.08

hed

0.08

Activations Density 0.019%