INDEX
Explanations
mentions of the city "London"
references to London
New Auto-Interp
Negative Logits
onies
-0.91
icio
-0.73
ONY
-0.71
Ĥª
-0.68
utical
-0.68
ocious
-0.68
hemy
-0.67
isode
-0.67
++++
-0.67
ajor
-0.67
POSITIVE LOGITS
borough
1.11
Borough
1.09
Underground
1.07
Heath
0.99
Bridge
0.96
Whale
0.95
Calling
0.92
shire
0.90
Metropolitan
0.89
Mayor
0.89
Activations Density 0.029%