INDEX
Explanations
mentions of the cities Pittsburgh and Montreal
Pittsburgh, Montreal, Seattle
New Auto-Interp
Negative Logits
сыл
-0.41
bill
-0.38
issory
-0.37
دهنده
-0.36
lb
-0.36
sum
-0.36
ضات
-0.36
бил
-0.35
子
-0.35
term
-0.34
POSITIVE LOGITS
Moscow
0.87
Tokyo
0.86
ADELPHIA
0.84
Chicago
0.83
Amsterdam
0.82
Moscou
0.81
Toronto
0.81
Atlanta
0.80
Beijing
0.80
Philadelphia
0.80
Activations Density 0.045%