INDEX
Negative Logits
ungi
-0.18
viso
-0.15
itters
-0.14
notated
-0.14
.selenium
-0.14
vincia
-0.14
resse
-0.13
ÏĮμÏīÏĤ
-0.13
umn
-0.13
)↵↵↵↵↵↵↵↵
-0.13
POSITIVE LOGITS
oret
0.19
vain
0.15
MS
0.15
sans
0.14
accent
0.14
odor
0.13
Rena
0.13
Saf
0.13
aud
0.13
near
0.13
Activations Density 0.100%