INDEX
Explanations
specific locations or places
proper nouns related to cities and locations
New Auto-Interp
Negative Logits
leigh
-0.58
bread
-0.58
Lomb
-0.57
Burgess
-0.57
Kash
-0.56
rano
-0.56
Staten
-0.55
bottom
-0.55
Greenberg
-0.54
ogether
-0.54
POSITIVE LOGITS
ION
1.18
HI
1.06
IONS
0.99
ONDON
0.95
ANCE
0.94
ING
0.94
ENN
0.93
LIN
0.93
TON
0.92
IST
0.91
Activations Density 0.034%