INDEX
Negative Logits
udel
-0.09
possibly
-0.08
silently
-0.08
randomly
-0.08
sammen
-0.08
hete
-0.07
ussions
-0.07
Consequently
-0.07
Nazar
-0.07
inti
-0.07
POSITIVE LOGITS
urgent
0.08
Urg
0.08
carving
0.08
aussehen
0.08
pupọ
0.08
carved
0.07
велик
0.07
exper
0.07
brass
0.07
carve
0.07
Activations Density 0.001%