INDEX
Explanations
words related to political moderation
references to the term "moderate" in various contexts
New Auto-Interp
Negative Logits
arium
-0.85
raltar
-0.77
æ©
-0.72
yang
-0.71
TYPE
-0.71
ograp
-0.71
asper
-0.70
ADRA
-0.70
chy
-0.70
STON
-0.69
POSITIVE LOGITS
erate
0.96
sized
0.88
minded
0.84
medi
0.77
fare
0.77
leaning
0.76
lees
0.71
easing
0.69
(<
0.68
xual
0.68
Activations Density 0.025%