INDEX
Explanations
mentions of geographic regions, specifically Eastern European countries
New Auto-Interp
Negative Logits
bats
-0.87
renheit
-0.81
GUI
-0.80
VERTISEMENT
-0.77
pload
-0.77
asonable
-0.76
wik
-0.76
pass
-0.75
mouth
-0.73
mercial
-0.73
POSITIVE LOGITS
countries
1.33
nations
1.25
Countries
1.13
dictators
1.03
Languages
0.99
region
0.97
istan
0.95
regions
0.95
kingdoms
0.95
languages
0.94
Activations Density 0.072%