INDEX
Explanations
terms related to oligarchy and power dynamics
references to oligarchy
New Auto-Interp
Negative Logits
20439
-0.90
à¨
-0.76
birth
-0.73
CARE
-0.73
olver
-0.70
Courage
-0.70
Strong
-0.69
Flying
-0.69
leigh
-0.67
Dill
-0.67
POSITIVE LOGITS
archs
1.31
archy
1.20
opol
1.13
opoly
1.04
olig
1.03
ration
1.03
orius
1.00
arch
0.91
eties
0.89
ovy
0.89
Activations Density 0.038%