INDEX
Explanations
references to physical activity and exercise
New Auto-Interp
Negative Logits
lak
-0.16
DJ
-0.15
ofilm
-0.15
wash
-0.14
DJ
-0.14
chod
-0.14
doch
-0.14
onden
-0.14
ise
-0.14
Dj
-0.14
POSITIVE LOGITS
exercise
0.19
/stretch
0.18
nas
0.18
fitness
0.18
gym
0.18
_THROW
0.16
exercise
0.16
Exercise
0.16
exerc
0.15
Fitness
0.15
Activations Density 0.128%