INDEX
Explanations
words related to power and control, especially focusing on political and economic elites
references to social and political elites
New Auto-Interp
Negative Logits
ãĥ¼ãĥ³
-0.81
Archangel
-0.73
SourceFile
-0.70
Dull
-0.68
Ship
-0.67
agh
-0.66
upon
-0.66
Deadly
-0.66
Õ
-0.66
atur
-0.66
POSITIVE LOGITS
elites
1.19
ervatives
0.92
ablishment
0.91
rats
0.88
incent
0.88
millenn
0.82
ervative
0.81
princ
0.81
usional
0.81
elite
0.80
Activations Density 0.015%