INDEX
Explanations
references to individuals or groups of people
New Auto-Interp
Negative Logits
(es
-0.21
stadt
-0.18
ship
-0.16
berg
-0.16
ìľ¨
-0.16
ï¸ı
-0.16
asio
-0.15
ãģ¯ãģªãģĦ
-0.15
wner
-0.14
appName
-0.14
POSITIVE LOGITS
who
0.35
/entities
0.26
whom
0.26
who
0.25
/groups
0.24
Who
0.22
whose
0.21
اÙĦذÙĬÙĨ
0.21
hood
0.19
Who
0.19
Activations Density 0.121%