INDEX
Explanations
words indicating possession or actions taken by a subject
New Auto-Interp
Negative Logits
Mayor
-0.16
elay
-0.16
ught
-0.15
Byrne
-0.14
па
-0.14
Mayor
-0.13
aspers
-0.13
-wide
-0.13
usch
-0.13
Rum
-0.13
POSITIVE LOGITS
/ion
0.16
è°·
0.15
wcs
0.15
iple
0.15
èĪĮ
0.14
IGO
0.14
γÏĮ
0.14
è¦
0.13
Orch
0.13
illac
0.13
Activations Density 0.183%