INDEX
Explanations
mentions of the concept of democracy
references to democracy
New Auto-Interp
Negative Logits
senal
-0.77
AMI
-0.74
abee
-0.68
ventory
-0.68
iday
-0.64
ifax
-0.63
ultraviolet
-0.63
ulla
-0.62
ueless
-0.62
entious
-0.62
POSITIVE LOGITS
ocracy
0.83
democracy
0.83
enshr
0.82
ocrat
0.81
republic
0.79
democracies
0.78
democracy
0.76
safeguards
0.75
eering
0.75
reform
0.74
Activations Density 0.040%