INDEX
Explanations
references to walking and walks
references to walking activities
New Auto-Interp
Negative Logits
cffff
-0.80
uchin
-0.71
ccording
-0.71
illian
-0.71
uliffe
-0.71
milo
-0.71
ultz
-0.69
ARA
-0.65
ancies
-0.64
berra
-0.64
POSITIVE LOGITS
Walk
1.01
walk
0.95
Walk
0.94
walk
0.91
ways
0.89
walker
0.87
athon
0.87
own
0.86
lihood
0.84
walking
0.83
Activations Density 0.017%