INDEX
Explanations
sports-related terms, including scores, wins, losses, and player statistics
New Auto-Interp
Negative Logits
agna
-0.76
icus
-0.64
don
-0.63
eri
-0.60
pain
-0.59
appropriately
-0.58
use
-0.58
advert
-0.58
study
-0.58
ctrl
-0.57
POSITIVE LOGITS
½
0.76
ĪĴ
0.75
underdog
0.74
aggregate
0.71
overall
0.71
overtime
0.70
halftime
0.70
FG
0.68
TING
0.67
-+-+
0.66
Activations Density 0.530%