INDEX
Explanations
the word "and" at various frequencies in the text
New Auto-Interp
Negative Logits
oko
-0.19
iko
-0.18
ivo
-0.16
itz
-0.15
oom
-0.15
ogue
-0.14
ük
-0.14
kh
-0.14
ohn
-0.14
omas
-0.14
POSITIVE LOGITS
alten
0.16
ackbar
0.16
igest
0.15
Doug
0.15
igure
0.15
EMPLARY
0.14
acias
0.14
odzi
0.14
misd
0.14
atk
0.14
Activations Density 0.020%