INDEX
Negative Logits
ros
-0.07
PROP
-0.06
YC
-0.06
.hwp
-0.06
slun
-0.06
Sox
-0.06
капит
-0.06
StatusCode
-0.06
Skate
-0.06
ฟ
-0.06
POSITIVE LOGITS
Mixing
0.07
Without
0.07
~
0.07
.factor
0.07
without
0.06
도
0.06
education
0.06
)*
0.06
odia
0.06
[.
0.06
Activations Density 0.007%