INDEX
Explanations
terms related to democracy and political terminology
mentions of democratic processes and documentation
New Auto-Interp
Negative Logits
Dunham
-0.65
looting
-0.64
Jackson
-0.64
range
-0.63
Booker
-0.63
venture
-0.62
Solo
-0.61
visibility
-0.60
circulation
-0.60
Clair
-0.60
POSITIVE LOGITS
dem
3.17
common
1.92
doc
1.45
dot
1.24
demon
1.14
dash
1.09
dev
1.01
dos
0.99
des
0.98
tar
0.98
Activations Density 0.009%