INDEX
Explanations
actions and movements related to physical tasks or responsibilities
New Auto-Interp
Negative Logits
favorable
-0.20
unfavorable
-0.19
catalogs
-0.18
zipper
-0.17
enrollment
-0.17
Enrollment
-0.17
upward
-0.16
neighborhoods
-0.16
uki
-0.16
oriented
-0.16
POSITIVE LOGITS
round
0.38
round
0.23
forwards
0.22
aged
0.22
sens
0.22
fraction
0.20
Round
0.20
ROUND
0.19
.round
0.19
whilst
0.19
Activations Density 0.308%