INDEX
Negative Logits
eton
-0.17
igans
-0.16
usher
-0.15
rait
-0.14
ÑĢаÑħ
-0.14
sice
-0.14
Ù¾ÛĮØ´
-0.13
reap
-0.13
RM
-0.13
def
-0.13
POSITIVE LOGITS
Hind
0.14
562
0.14
_lang
0.14
holes
0.14
lang
0.14
оÑĢи
0.14
ĩ¼
0.14
iani
0.14
astro
0.14
лиз
0.13
Activations Density 0.005%