INDEX
Negative Logits
(Index
-0.07
Fail
-0.06
Coul
-0.06
_FILL
-0.06
نمود
-0.06
.receiver
-0.06
Plant
-0.06
Prefix
-0.06
zel
-0.06
699
-0.06
POSITIVE LOGITS
accepted
0.07
anny
0.07
internship
0.07
mlad
0.07
recru
0.07
civ
0.07
терн
0.07
(an
0.07
dumping
0.07
Dayton
0.07
Activations Density 0.005%