INDEX
Negative Logits
uada
-0.17
ÄĻk
-0.15
Ñľ
-0.15
ÑĥÑĪки
-0.15
unga
-0.15
chner
-0.15
vac
-0.14
jak
-0.14
iani
-0.14
awai
-0.14
POSITIVE LOGITS
Om
0.16
cura
0.16
interoper
0.15
sm
0.15
aucoup
0.15
urg
0.14
ali
0.14
arker
0.14
itzer
0.14
disp
0.14
Activations Density 0.014%