INDEX
Explanations
references to subway stations
mentions of subway systems
New Auto-Interp
Negative Logits
wine
-0.74
iyah
-0.72
cius
-0.71
ificial
-0.71
erenn
-0.69
abama
-0.69
ivia
-0.68
laus
-0.67
Christensen
-0.66
arget
-0.66
POSITIVE LOGITS
subway
1.08
Subway
0.98
commute
0.89
station
0.84
commuting
0.83
stations
0.81
trains
0.81
Transit
0.79
roads
0.78
entrances
0.78
Activations Density 0.019%