INDEX
Explanations
references to democratic concepts or values
references to democratic concepts and values
New Auto-Interp
Negative Logits
olog
-0.77
Painter
-0.75
thur
-0.75
oles
-0.72
imus
-0.72
igi
-0.70
cial
-0.70
ting
-0.69
INESS
-0.69
other
-0.68
POSITIVE LOGITS
democratic
1.12
socialist
1.11
republic
1.10
republican
1.00
democracy
0.94
governance
0.88
freedoms
0.87
citiz
0.86
minded
0.85
rights
0.85
Activations Density 0.010%