INDEX
Explanations
references to sports scores and leads
phrases related to numerical advantages or scoring in competitive contexts
New Auto-Interp
Negative Logits
arus
-0.69
romy
-0.67
acron
-0.63
concess
-0.61
menus
-0.60
aida
-0.59
mbuds
-0.58
ammad
-0.58
ibur
-0.57
unemploy
-0.57
POSITIVE LOGITS
breaker
0.89
heading
0.84
over
0.80
OVER
0.78
over
0.76
break
0.76
breaking
0.75
stre
0.73
holding
0.72
noon
0.71
Activations Density 0.080%