INDEX
Explanations
references to geographical locations, particularly regions and countries
references to Latin American and Asian regions or cultures
New Auto-Interp
Negative Logits
bats
-0.87
mercial
-0.84
pload
-0.82
wik
-0.80
PASS
-0.79
isSpecialOrderable
-0.78
FOX
-0.77
cloud
-0.75
VERTISEMENT
-0.74
soType
-0.74
POSITIVE LOGITS
dictators
1.00
istan
1.00
countries
0.99
nations
0.93
regimes
0.88
Economic
0.81
civilisation
0.81
peoples
0.80
Emir
0.79
descent
0.79
Activations Density 0.049%