INDEX
Explanations
pronouns and references to individuals' roles and achievements
New Auto-Interp
Negative Logits
ä¸Ī
-0.15
tered
-0.14
أبÙĬ
-0.13
Roe
-0.13
MMC
-0.13
.aspx
-0.13
stav
-0.13
rapper
-0.13
jer
-0.13
tparam
-0.13
POSITIVE LOGITS
also
0.18
also
0.17
recent
0.16
juga
0.16
ÏģÏİ
0.15
oldem
0.15
recently
0.15
rovnÄĽÅ¾
0.15
ÙĩÙħÚĨÙĨÛĮÙĨ
0.15
bows
0.14
Activations Density 0.067%