INDEX
Explanations
references to sports-related activities and events
New Auto-Interp
Negative Logits
Ballet
-0.17
swim
-0.16
dk
-0.16
uto
-0.15
otes
-0.15
olum
-0.15
swimming
-0.15
èī¯
-0.15
ÃŃch
-0.14
Soccer
-0.14
POSITIVE LOGITS
curl
0.35
Curl
0.29
curl
0.28
curled
0.24
CURL
0.22
singles
0.21
stones
0.21
skipped
0.20
curls
0.20
skips
0.20
Activations Density 0.005%