INDEX
Negative Logits
agens
-0.17
icle
-0.16
ple
-0.15
anon
-0.14
[unit
-0.14
ker
-0.14
icol
-0.14
ixin
-0.14
horn
-0.14
kos
-0.14
POSITIVE LOGITS
ouns
0.18
nem
0.17
crest
0.17
ville
0.16
rim
0.15
Ñħо
0.15
oun
0.15
illac
0.15
ymi
0.15
oreach
0.14
Activations Density 0.049%