INDEX
Explanations
references to female characters in media, particularly in relation to their roles and relationships
New Auto-Interp
Negative Logits
handsome
-0.20
-0.18
him
-0.17
Romeo
-0.16
Eric
-0.15
guy
-0.15
Andrew
-0.15
Patrick
-0.15
Mario
-0.15
Eddie
-0.15
POSITIVE LOGITS
Gill
0.19
Rarity
0.19
porcelain
0.17
Tracy
0.17
Meredith
0.17
Fel
0.17
herself
0.17
Со
0.16
Meg
0.16
actresses
0.16
Activations Density 0.144%