INDEX
Explanations
mentions of Western countries
references to "Western" entities or concepts
New Auto-Interp
Negative Logits
staking
-0.87
Downloadha
-0.81
chery
-0.79
Æ
-0.78
hod
-0.73
govtrack
-0.72
rss
-0.72
displayText
-0.71
ucha
-0.71
hid
-0.70
POSITIVE LOGITS
Hemisphere
1.18
ization
0.96
hemisphere
0.92
Isles
0.90
Civilization
0.90
ized
0.89
izing
0.88
Sahara
0.87
civilization
0.87
Union
0.87
Activations Density 0.029%