INDEX
Explanations
sentences with varying punctuation marks
New Auto-Interp
Negative Logits
Selectors
-0.15
bana
-0.15
ilter
-0.15
emic
-0.14
iga
-0.14
Ïĥαν
-0.13
bs
-0.13
ÑĩÑĥк
-0.13
zon
-0.13
iven
-0.13
POSITIVE LOGITS
onaut
0.16
siz
0.15
½Ķ
0.15
ź
0.15
mrt
0.14
urette
0.14
mue
0.14
reeze
0.14
ÏĦεÏģ
0.14
sink
0.14
Activations Density 0.004%