INDEX
Negative Logits
THAT
-0.07
depress
-0.07
snug
-0.07
Bankruptcy
-0.07
fam
-0.07
Problem
-0.07
_ALLOWED
-0.07
translations
-0.07
(problem
-0.07
ridiculously
-0.07
POSITIVE LOGITS
Mens
0.08
Mens
0.08
gluc
0.08
kron
0.08
dose
0.08
ೀನ
0.07
Psalm
0.07
帰
0.07
Kir
0.07
Kir
0.07
Activations Density 0.012%