INDEX
Explanations
mentions of individuals or groups of people in various contexts
New Auto-Interp
Negative Logits
clude
-0.14
ning
-0.14
öz
-0.14
something
-0.14
isode
-0.14
åħ¶ä¸Ń
-0.13
agus
-0.13
makt
-0.13
ssf
-0.13
êtes
-0.13
POSITIVE LOGITS
who
0.31
everywhere
0.30
who
0.25
worldwide
0.21
across
0.21
Who
0.18
Everywhere
0.18
Who
0.18
whose
0.17
ino
0.16
Activations Density 0.296%