INDEX
Explanations
references to female representation and gender dynamics in video games
New Auto-Interp
Negative Logits
Claw
-0.16
xes
-0.15
Snape
-0.15
_OS
-0.15
614
-0.15
757
-0.15
idian
-0.15
681
-0.15
636
-0.15
äge
-0.14
POSITIVE LOGITS
Naughty
0.38
Ellie
0.31
Joel
0.29
Na
0.25
Na
0.23
Ell
0.23
Ell
0.22
ell
0.22
naughty
0.22
ellipt
0.21
Activations Density 0.030%