INDEX
Explanations
proper nouns related to sports events and associated teams or players
New Auto-Interp
Negative Logits
symp
-0.82
weap
-0.78
condem
-0.76
cheek
-0.74
bead
-0.74
natureconservancy
-0.72
cuff
-0.72
corrid
-0.72
tatt
-0.70
rhy
-0.70
POSITIVE LOGITS
1978
0.96
2018
0.88
2014
0.86
2017
0.85
1994
0.85
2019
0.85
1979
0.85
1998
0.84
2013
0.83
2019
0.83
Activations Density 0.018%