INDEX
Negative Logits
putih
-0.08
visually
-0.07
insensitive
-0.07
ಿವ
-0.07
deemed
-0.07
sembr
-0.07
sembra
-0.07
adm
-0.07
hlas
-0.07
kronor
-0.07
POSITIVE LOGITS
RAS
0.08
_CUSTOM
0.08
hingegen
0.08
Bax
0.08
మాత్రం
0.08
?
0.08
varten
0.08
Mast
0.08
sempre
0.08
역시
0.07
Activations Density 0.012%