INDEX
Negative Logits
uel
-0.19
ovich
-0.17
edom
-0.16
acl
-0.16
Gaul
-0.15
erts
-0.15
eda
-0.15
ologie
-0.14
yo
-0.14
layer
-0.14
POSITIVE LOGITS
Kemp
0.18
اÙĨÙĪ
0.16
ancor
0.15
_cats
0.15
impro
0.15
ewire
0.15
enor
0.15
¢
0.15
é±
0.15
copp
0.14
Activations Density 0.013%