INDEX
Explanations
references to sports teams and locations related to hockey
New Auto-Interp
Negative Logits
orer
-0.16
okes
-0.16
ogenesis
-0.15
luent
-0.15
olar
-0.14
Ferm
-0.14
onaut
-0.14
Oregon
-0.14
imos
-0.14
Oregon
-0.14
POSITIVE LOGITS
NYC
0.26
NY
0.24
NY
0.23
.ny
0.16
vsp
0.16
Brooklyn
0.15
ãĥ¼ãĥģ
0.15
NYPD
0.15
/stdc
0.15
Rutgers
0.15
Activations Density 0.675%