INDEX
Explanations
actions or events related to sports achievements and competitions
New Auto-Interp
Negative Logits
etak
-0.17
981
-0.15
okrat
-0.15
mlin
-0.14
achs
-0.14
welt
-0.14
aks
-0.14
atsu
-0.14
ForResult
-0.13
quil
-0.13
POSITIVE LOGITS
victory
0.20
Victory
0.17
passage
0.16
control
0.16
spots
0.16
brag
0.15
resco
0.15
honors
0.15
DAMAGES
0.15
èĥľ
0.14
Activations Density 0.091%