INDEX
Explanations
words related to competitive races or contests
references to the concept of "race."
New Auto-Interp
Negative Logits
vou
-0.69
Vera
-0.69
ç«
-0.68
vor
-0.64
answered
-0.64
hern
-0.64
fired
-0.62
Prov
-0.62
Lay
-0.62
stairs
-0.61
POSITIVE LOGITS
race
3.93
Race
2.99
races
2.80
Race
2.78
race
2.64
Races
2.17
racing
1.70
racial
1.60
racer
1.57
raced
1.54
Activations Density 0.017%