INDEX
Explanations
words related to political and social systems, specifically focusing on terms related to mediocrity and democracy
terms related to various forms of government and political themes
New Auto-Interp
Negative Logits
ting
-0.86
ster
-0.78
ing
-0.75
listed
-0.71
ez
-0.70
staking
-0.67
ful
-0.65
bang
-0.63
fully
-0.62
ning
-0.62
POSITIVE LOGITS
acies
0.97
atism
0.90
kefeller
0.87
atically
0.84
ocracy
0.83
ocratic
0.79
phia
0.77
ocr
0.76
atics
0.76
asms
0.75
Activations Density 0.084%