INDEX
Explanations
references to sports events, especially marathons
New Auto-Interp
Negative Logits
metics
-0.88
orum
-0.74
onom
-0.70
choes
-0.69
onomic
-0.69
obyl
-0.68
oshop
-0.67
Debor
-0.67
olia
-0.67
imated
-0.66
POSITIVE LOGITS
runner
1.18
runners
1.04
Runner
0.95
athon
0.94
marathon
0.87
bombings
0.78
jog
0.77
Marathon
0.77
bombing
0.76
runner
0.74
Activations Density 0.024%