INDEX
Explanations
verbs related to movement on foot
occurrences of the word "walk" and its variations
New Auto-Interp
Negative Logits
Saud
-0.65
iller
-0.64
encies
-0.64
orious
-0.63
iler
-0.62
rent
-0.60
nant
-0.60
mal
-0.60
etric
-0.59
iling
-0.58
POSITIVE LOGITS
away
1.08
through
1.01
upright
0.92
confidently
0.88
uphill
0.86
brisk
0.86
into
0.85
ginger
0.84
about
0.83
awa
0.83
Activations Density 0.048%