INDEX
Explanations
references to walking and outdoor activities
New Auto-Interp
Negative Logits
ajs
-0.16
eel
-0.15
ÑĢÑĮ
-0.15
ldb
-0.15
UNCTION
-0.14
unker
-0.14
ebe
-0.14
boarding
-0.13
unction
-0.13
ween
-0.13
POSITIVE LOGITS
walks
0.54
walk
0.49
walk
0.45
Walk
0.43
Walk
0.42
walked
0.41
_walk
0.40
walking
0.39
.walk
0.36
walker
0.35
Activations Density 0.116%