INDEX
Explanations
the word "can" in various contexts indicating ability or options
New Auto-Interp
Negative Logits
airo
-0.18
stvo
-0.15
ignon
-0.15
aurus
-0.14
etto
-0.14
itself
-0.14
ÙĨدÙĬ
-0.14
irror
-0.14
ngo
-0.13
undry
-0.13
POSITIVE LOGITS
freely
0.18
266
0.16
’t
0.15
elas
0.15
624
0.15
801
0.15
239
0.14
acre
0.14
cheng
0.14
LEE
0.14
Activations Density 0.077%