INDEX
Explanations
the action of walking
instances of the word "walked" in various contexts
New Auto-Interp
Negative Logits
ional
-0.88
etric
-0.77
ency
-0.76
iled
-0.76
ãĥª
-0.75
usable
-0.74
relevant
-0.74
emin
-0.73
encies
-0.73
ccording
-0.71
POSITIVE LOGITS
Walk
0.90
walk
0.88
walks
0.84
ashore
0.82
stroll
0.81
stride
0.81
walked
0.79
walk
0.78
Walk
0.77
escription
0.77
Activations Density 0.010%