INDEX
Explanations
mentions of specific city names, particularly focusing on capital cities
mentions of various capitals
New Auto-Interp
Negative Logits
potion
-0.80
AUT
-0.76
sbm
-0.76
akers
-0.73
wd
-0.70
eker
-0.70
ĪĴ
-0.69
Choice
-0.68
hner
-0.67
MpServer
-0.67
POSITIVE LOGITS
metropolitan
0.91
city
0.86
suburb
0.81
cities
0.80
itals
0.76
uania
0.76
metro
0.74
Manila
0.71
ashtra
0.70
omach
0.70
Activations Density 0.017%