INDEX
Explanations
terms related to safety and competition in sports
New Auto-Interp
Negative Logits
iei
-0.18
orsch
-0.16
isphere
-0.14
esi
-0.14
aming
-0.14
mag
-0.14
Kraj
-0.13
landmark
-0.13
andes
-0.13
ź
-0.13
POSITIVE LOGITS
popularity
0.22
Played
0.16
practiced
0.16
popular
0.15
dưỡng
0.14
аÑĢод
0.14
elit
0.14
ariant
0.14
popular
0.14
passion
0.14
Activations Density 0.110%