INDEX
Negative Logits
ONE
-0.09
_LOC
-0.09
.FL
-0.08
Ế
-0.08
Apparently
-0.08
_VOID
-0.08
aurus
-0.08
Rauch
-0.08
_STEP
-0.08
agre
-0.07
POSITIVE LOGITS
ago
0.09
crunch
0.08
sex
0.07
Под
0.07
diet
0.07
digestive
0.07
processed
0.07
tertentu
0.07
دې
0.07
zusammen
0.07
Activations Density 0.003%