INDEX
Explanations
references to women's professional sports or organizations
New Auto-Interp
Negative Logits
ugged
-0.16
iyi
-0.16
nger
-0.15
ngen
-0.15
pto
-0.15
eton
-0.14
Ñĩки
-0.14
isch
-0.14
et
-0.14
Fish
-0.14
POSITIVE LOGITS
hevik
0.18
anno
0.16
oldt
0.15
غÙĦ
0.15
ãģĨãģ¡
0.14
isson
0.14
cent
0.14
aña
0.14
IDI
0.14
wear
0.14
Activations Density 0.040%