INDEX
Explanations
references to locations and geographical features
New Auto-Interp
Negative Logits
Southampton
-0.20
Tottenham
-0.18
UNCT
-0.17
Bronx
-0.17
Yorkshire
-0.16
NH
-0.16
Liverpool
-0.15
Glouce
-0.15
Buckingham
-0.15
Durham
-0.15
POSITIVE LOGITS
Idaho
0.71
Boise
0.65
Nez
0.40
Spokane
0.40
Id
0.39
Ada
0.39
ID
0.35
IDA
0.34
(Id
0.34
,ID
0.33
Activations Density 0.012%