INDEX
Explanations
references to gendered pronouns
New Auto-Interp
Negative Logits
Biôgrafia
-0.82
PreferredItem
-0.81
UnsafeEnabled
-0.74
Roskov
-0.71
StoryboardSegue
-0.69
httphttps
-0.66
Hentet
-0.65
EndInit
-0.65
gameserver
-0.65
PreExecute
-0.63
POSITIVE LOGITS
חיצוניים
0.61
she
0.57
she
0.56
zij
0.54
$__
0.51
dumne
0.51
himself
0.51
hers
0.51
She
0.50
ticoli
0.49
Activations Density 0.216%