INDEX
Explanations
activities related to physical exertion or exercise
New Auto-Interp
Negative Logits
irect
-0.18
emey
-0.16
sthrough
-0.16
eman
-0.15
jez
-0.15
reator
-0.14
internal
-0.14
.Apis
-0.14
strup
-0.14
llib
-0.14
POSITIVE LOGITS
competit
0.26
recre
0.24
seriously
0.24
regularly
0.24
naked
0.19
hard
0.19
professionally
0.18
intens
0.18
bare
0.17
/browse
0.17
Activations Density 0.179%