INDEX
Negative Logits
itſelf
-1.23
yntaxException
-1.19
myſelf
-1.18
feroit
-1.12
Мексичка
-1.09
avoient
-1.08
ſelves
-1.06
Theſe
-1.06
новниш
-1.05
auroit
-1.05
POSITIVE LOGITS
ally
0.71
or
0.56
for
0.55
,
0.55
re
0.54
and
0.52
red
0.52
act
0.52
/
0.51
un
0.50
Activations Density 0.053%