INDEX
Explanations
mentions of winning or victory
instances of the word "won" and its variants, which indicate victories or success
New Auto-Interp
Negative Logits
senal
-0.64
heter
-0.62
tan
-0.62
scanner
-0.61
opter
-0.60
sket
-0.60
tar
-0.59
trolls
-0.58
erous
-0.58
ria
-0.58
POSITIVE LOGITS
't
0.94
nings
0.87
championships
0.85
streak
0.82
Rookie
0.82
outright
0.80
ced
0.79
streaks
0.79
MVP
0.76
throp
0.76
Activations Density 0.054%