INDEX
Explanations
references to urban development and infrastructure
New Auto-Interp
Negative Logits
Akron
-0.22
Ohio
-0.19
Ohio
-0.18
Clemson
-0.17
Niagara
-0.16
Syracuse
-0.16
Pittsburgh
-0.15
Nebraska
-0.15
Minnesota
-0.15
Asheville
-0.15
POSITIVE LOGITS
London
0.61
London
0.55
london
0.54
Tube
0.43
tube
0.41
ondon
0.41
Tf
0.40
Tube
0.38
Lond
0.38
tube
0.36
Activations Density 0.253%