INDEX
Negative Logits
[]{-0.08
Xu
-0.07
modifying
-0.07
EC
-0.07
controlling
-0.07
સ
-0.07
Ens
-0.07
monot
-0.07
stata
-0.07
Dy
-0.07
POSITIVE LOGITS
hostility
0.09
dives
0.09
angu
0.08
violence
0.08
ób
0.08
warfare
0.08
verdeeld
0.08
যুদ্ধ
0.08
slashes
0.08
Angels
0.08
Activations Density 0.003%