INDEX
Explanations
verbs related to quick movement or action
instances of urgent or hurried actions
New Auto-Interp
Negative Logits
Ranked
-0.75
iciency
-0.69
Continued
-0.68
nton
-0.65
cius
-0.63
oln
-0.59
illusion
-0.59
minus
-0.58
ãĥĩãĤ£
-0.58
ancy
-0.58
POSITIVE LOGITS
onto
0.94
ashore
0.94
toward
0.92
forward
0.91
hither
0.90
blindly
0.90
overboard
0.87
into
0.86
towards
0.86
frantically
0.84
Activations Density 0.060%