INDEX
Negative Logits
D
-0.43
LOU
-0.42
K
-0.41
are
-0.40
a
-0.40
L
-0.40
emes
-0.40
M
-0.38
Arg
-0.38
Le
-0.38
POSITIVE LOGITS
Administrativna
0.60
ς
0.59
Italijani
0.56
ItemBackground
0.56
ModelExpression
0.54
iddhar
0.54
cipar
0.54
клопе
0.53
WriteBarrier
0.53
Vitale
0.53
Activations Density 0.223%