INDEX
Explanations
references to New York or its neighborhoods
New Auto-Interp
Negative Logits
Stick
-0.16
ainer
-0.15
editary
-0.15
drum
-0.15
blindly
-0.15
èģ¯
-0.15
ë¦Ħ
-0.14
Coff
-0.14
Manitoba
-0.14
.bz
-0.14
POSITIVE LOGITS
York
0.32
Haven
0.21
York
0.21
ìļķ
0.20
YORK
0.20
york
0.20
Jersey
0.19
-Y
0.19
NY
0.18
Roch
0.18
Activations Density 0.023%