INDEX
Negative Logits
emp
-0.07
baseline
-0.06
TextWriter
-0.06
habitats
-0.06
amm
-0.06
Scale
-0.06
-ves
-0.06
zend
-0.06
proc
-0.06
quis
-0.06
POSITIVE LOGITS
prene
0.07
Москва
0.07
scoring
0.06
UPPORT
0.06
تاریخ
0.06
(/^\
0.06
σφ
0.06
Profession
0.06
(commit
0.06
intercepted
0.06
Activations Density 0.000%