INDEX
Explanations
mentions or references to the Stanley Cup
references to the Stanley Cup
New Auto-Interp
Negative Logits
ged
-0.93
gil
-0.82
hy
-0.81
anwhile
-0.79
ansas
-0.78
nir
-0.78
gel
-0.77
du
-0.76
lez
-0.76
htar
-0.75
POSITIVE LOGITS
Kubrick
1.07
Stanley
0.90
Winter
0.80
Baldwin
0.80
thal
0.80
Transactions
0.78
Bros
0.70
Clarke
0.70
Robinson
0.70
Games
0.69
Activations Density 0.014%