INDEX
Explanations
mentions of individuals and organizations
New Auto-Interp
Negative Logits
ess
-0.17
ANGO
-0.15
UGIN
-0.15
pearance
-0.15
YST
-0.14
گاÙĩ
-0.14
emek
-0.14
wort
-0.13
iol
-0.13
anian
-0.13
POSITIVE LOGITS
who
0.24
/groups
0.24
/entities
0.23
whom
0.20
who
0.19
/entity
0.19
ized
0.17
itarian
0.17
ified
0.16
Who
0.16
Activations Density 0.037%