INDEX
Negative Logits
nonnull
-0.17
bast
-0.16
eh
-0.15
legalization
-0.15
dra
-0.14
tür
-0.14
_globals
-0.14
ÑĩиÑħ
-0.14
reesome
-0.14
dish
-0.14
POSITIVE LOGITS
Sesso
0.17
yers
0.15
enda
0.15
yer
0.15
raid
0.15
umba
0.14
ër
0.14
erties
0.14
heck
0.14
jez
0.14
Activations Density 0.004%