INDEX
Explanations
words related to competitions or contests
occurrences of the word "race."
New Auto-Interp
Negative Logits
alty
-0.69
urers
-0.69
oca
-0.67
arial
-0.67
azaki
-0.65
olia
-0.65
etheless
-0.63
ricular
-0.63
azy
-0.63
orescent
-0.62
POSITIVE LOGITS
course
1.44
horse
1.34
cars
1.09
car
1.04
bike
0.89
mma
0.84
ways
0.81
manship
0.80
runner
0.78
nell
0.78
Activations Density 0.025%