INDEX
Explanations
countries and the specific entities that are situated in these countries
statements about the geographical significance or status of various countries
New Auto-Interp
Negative Logits
xual
-0.75
owan
-0.74
sers
-0.70
ipers
-0.69
validity
-0.63
anches
-0.62
sed
-0.62
aroo
-0.62
ults
-0.61
curls
-0.61
POSITIVE LOGITS
ranked
0.96
experiencing
0.94
exporting
0.93
witnessing
0.92
reeling
0.87
poised
0.85
importing
0.83
fortunate
0.82
booming
0.81
gripped
0.80
Activations Density 0.180%