INDEX
Explanations
references to victories in sports
words related to winning in sports contexts
New Auto-Interp
Negative Logits
enegger
-0.69
cens
-0.67
Dupl
-0.65
Wiki
-0.62
forks
-0.62
intestine
-0.62
Fires
-0.62
notor
-0.61
labour
-0.61
Versions
-0.60
POSITIVE LOGITS
nings
1.27
streak
1.14
liness
1.01
throp
0.96
iem
0.93
stre
0.90
streaks
0.89
less
0.88
now
0.83
expectancy
0.82
Activations Density 0.050%