INDEX
Explanations
the word "Do" as an imperative or question prompt
New Auto-Interp
Negative Logits
uyo
-0.19
ipi
-0.17
usto
-0.16
.tv
-0.15
nnen
-0.15
anco
-0.14
ungan
-0.14
uy
-0.14
'gc
-0.14
pi
-0.13
POSITIVE LOGITS
zens
0.20
antes
0.17
anter
0.16
seg
0.15
SENT
0.15
orm
0.14
olie
0.14
ÑĥÑĩа
0.14
zen
0.14
tang
0.14
Activations Density 0.029%