INDEX
Explanations
references to historical political crises and power dynamics in governance
New Auto-Interp
Negative Logits
.Magic
-0.18
oose
-0.18
cub
-0.17
insky
-0.17
afka
-0.15
apolis
-0.15
cubic
-0.14
íĮĶ
-0.14
padd
-0.14
XT
-0.14
POSITIVE LOGITS
164
0.35
166
0.32
165
0.31
Crom
0.30
Parliament
0.27
Laud
0.25
Oliver
0.24
Pur
0.24
163
0.23
Restoration
0.23
Activations Density 0.021%