INDEX
Negative Logits
_classification
-0.07
-induced
-0.07
Permit
-0.07
skim
-0.07
[channel
-0.06
Scandinavian
-0.06
fizz
-0.06
emachine
-0.06
Sakura
-0.06
ste
-0.06
POSITIVE LOGITS
.sql
0.07
терап
0.06
_ALREADY
0.06
mücadele
0.06
наш
0.06
účast
0.06
>ID
0.06
COMMON
0.06
ödül
0.06
GLint
0.06
Activations Density 0.030%