INDEX
Explanations
mentions of losing or loss in a competitive context
terms associated with defeat or negative outcomes in sports contexts
New Auto-Interp
Negative Logits
tan
-0.85
enegger
-0.80
ridges
-0.77
azon
-0.73
oola
-0.73
onomic
-0.70
mun
-0.68
lon
-0.67
height
-0.66
ature
-0.66
POSITIVE LOGITS
streak
0.85
foes
0.81
streaks
0.81
miser
0.75
esses
0.72
aversion
0.69
opponents
0.67
horribly
0.67
boss
0.67
bite
0.66
Activations Density 0.039%