INDEX
Explanations
terms related to societal structures and power dynamics, particularly focusing on communities, establishments, and elites
references to various communities and their relationships with societal structures or power dynamics
New Auto-Interp
Negative Logits
Ô
-0.79
ishable
-0.71
opl
-0.70
Plate
-0.70
ocol
-0.66
itial
-0.64
planet
-0.62
aring
-0.62
Kard
-0.61
ĨĴ
-0.60
POSITIVE LOGITS
itself
0.80
members
0.79
ariat
0.76
hierarchy
0.76
members
0.74
alike
0.72
reacted
0.71
interven
0.69
leaders
0.69
assembled
0.68
Activations Density 0.201%