INDEX
Explanations
phrases related to victories or winning in sports
terms related to winning in competitive contexts
New Auto-Interp
Negative Logits
enegger
-0.76
Lak
-0.60
Fires
-0.60
Dupl
-0.59
opian
-0.58
encyclopedia
-0.56
Wrong
-0.55
Suicide
-0.55
heirs
-0.55
selves
-0.54
POSITIVE LOGITS
nings
1.34
streak
1.21
stre
1.19
throp
1.11
less
1.03
streaks
0.97
now
0.97
ery
0.94
iem
0.92
expectancy
0.90
Activations Density 0.037%