INDEX
Explanations
locations or areas within a city, particularly focusing on downtown areas
references to downtown areas
New Auto-Interp
Negative Logits
ktop
-0.89
uid
-0.85
nir
-0.79
nikov
-0.77
alys
-0.77
aye
-0.76
uth
-0.75
laughter
-0.75
nell
-0.71
ula
-0.71
POSITIVE LOGITS
Oakland
1.01
Denver
0.99
Manhattan
0.98
Seattle
0.96
Honolulu
0.96
Toronto
0.95
Raleigh
0.94
Vancouver
0.94
Los
0.93
Winnipeg
0.92
Activations Density 0.027%