INDEX
Explanations
terms related to sports and physical activities, particularly actions and outcomes in games
New Auto-Interp
Negative Logits
Rogers
-0.15
vik
-0.15
ocoder
-0.15
wid
-0.14
antar
-0.14
ÑĸÑĩ
-0.14
_sensitive
-0.14
.robot
-0.14
activ
-0.14
trip
-0.14
POSITIVE LOGITS
ball
0.66
balls
0.54
ball
0.53
Ball
0.51
-ball
0.49
Ball
0.47
balls
0.46
BALL
0.45
çIJĥ
0.44
_ball
0.44
Activations Density 0.155%