INDEX
Explanations
mentions of the concept of democracy
references to democracy
New Auto-Interp
Negative Logits
senal
-0.85
AMI
-0.73
notes
-0.73
hews
-0.72
FU
-0.70
thy
-0.69
abee
-0.69
ventory
-0.68
iple
-0.67
Contact
-0.67
POSITIVE LOGITS
ocracy
0.84
enshr
0.80
ocrat
0.79
eering
0.78
laureate
0.76
republic
0.71
democracy
0.71
watchdog
0.70
democracy
0.70
embodied
0.70
Activations Density 0.034%