INDEX
Explanations
instances of the word "further" and its various forms, indicating a focus on progression or additional information
New Auto-Interp
Negative Logits
run
-0.16
uno
-0.16
rum
-0.16
undo
-0.16
eln
-0.15
ernen
-0.15
vak
-0.15
hiá»ĥm
-0.15
uri
-0.15
ified
-0.15
POSITIVE LOGITS
ance
0.30
ado
0.26
-reaching
0.24
most
0.23
hin
0.23
ing
0.23
than
0.22
-than
0.22
MORE
0.19
-more
0.19
Activations Density 0.023%