INDEX
Negative Logits
evidenced
-0.07
-0.07
jenje
-0.07
victory
-0.07
enemies
-0.07
evidence
-0.06
Um
-0.06
كلا
-0.06
enemy
-0.06
Dois
-0.06
POSITIVE LOGITS
reka
0.09
=t
0.08
.tight
0.08
_subject
0.08
hingegen
0.08
tese
0.08
пән
0.08
stan
0.08
vaka
0.08
букмекер
0.08
Activations Density 0.186%