INDEX
Negative Logits
/D
-0.07
Aber
-0.06
Netherlands
-0.06
财
-0.06
decoding
-0.06
mulher
-0.06
_small
-0.06
BUFF
-0.06
regression
-0.06
خور
-0.06
POSITIVE LOGITS
lates
0.06
uddled
0.06
fossils
0.06
Nude
0.06
سف
0.06
�
0.06
qualification
0.06
#+
0.06
“
0.06
ildiği
0.05
Activations Density 0.013%