INDEX
Explanations
phrases related to specific cities
occurrences of the phrase "City of" followed by various city names
New Auto-Interp
Negative Logits
NetMessage
-0.81
wcs
-0.70
reddits
-0.69
recite
-0.69
requires
-0.68
<?
-0.68
ittees
-0.68
downs
-0.67
fights
-0.65
showc
-0.64
POSITIVE LOGITS
Light
0.88
Excellence
0.84
Sandwich
0.83
Record
0.82
Balance
0.81
Peace
0.80
York
0.79
Hate
0.79
Nations
0.78
Accountability
0.77
Activations Density 0.090%