INDEX
Explanations
references to sports, with a focus on teams losing
instances of the word "lost" related to game outcomes
New Auto-Interp
Negative Logits
ahu
-0.74
hire
-0.67
Companies
-0.65
kson
-0.64
ographies
-0.62
POL
-0.60
SPONSORED
-0.59
rods
-0.59
IPS
-0.58
NVIDIA
-0.57
POSITIVE LOGITS
miser
1.11
convinc
1.02
decisively
0.90
bitterly
0.79
horribly
0.79
to
0.78
badly
0.78
narrowly
0.74
heartbreaking
0.72
midway
0.69
Activations Density 0.096%