INDEX
Explanations
names of sports teams and players
New Auto-Interp
Negative Logits
stellar
-0.73
ana
-0.66
manned
-0.64
phased
-0.63
refrain
-0.59
boycot
-0.59
imaginable
-0.59
chau
-0.58
milit
-0.58
complementary
-0.58
POSITIVE LOGITS
airs
1.00
icker
0.98
akers
0.96
icks
0.95
ogging
0.94
ills
0.93
rows
0.92
ogs
0.92
angers
0.92
rosis
0.91
Activations Density 0.088%