INDEX
Explanations
instances of the verb "run" in various forms
New Auto-Interp
Negative Logits
mente
-0.19
ambre
-0.17
ence
-0.16
/qu
-0.16
unst
-0.15
phalt
-0.15
939
-0.15
/from
-0.15
ories
-0.15
ptal
-0.15
POSITIVE LOGITS
escape
0.19
abouts
0.17
igram
0.16
mage
0.15
-down
0.15
avigator
0.15
DDS
0.15
ispers
0.14
DDL
0.14
lin
0.14
Activations Density 0.099%