INDEX
Explanations
references to significant sports achievements or events
New Auto-Interp
Negative Logits
volt
-0.17
FromClass
-0.15
_supp
-0.15
ãģ¡ãĤī
-0.15
omb
-0.15
borg
-0.15
.Framework
-0.15
lus
-0.14
indr
-0.14
CRYPT
-0.14
POSITIVE LOGITS
sacrifice
0.27
ground
0.26
grou
0.26
sac
0.25
sacrific
0.22
blo
0.22
liner
0.21
infield
0.21
single
0.21
Sacr
0.20
Activations Density 0.006%