INDEX
Explanations
mentions of physical locomotion
references to walking
New Auto-Interp
Negative Logits
encies
-0.91
illet
-0.73
Products
-0.70
Dimension
-0.70
CCC
-0.70
haps
-0.67
tains
-0.65
orate
-0.65
evin
-0.64
quota
-0.64
POSITIVE LOGITS
walking
3.30
walking
2.29
Walking
2.21
walk
2.14
walks
1.92
wandering
1.86
walked
1.73
hiking
1.70
biking
1.65
Walk
1.63
Activations Density 0.026%