INDEX
Explanations
anti-liberal or anti-conservative ideologies
New Auto-Interp
Negative Logits
weld
0.70
វេ
0.70
Overnight
0.69
ocket
0.69
Coaching
0.68
த்திரை
0.68
captcha
0.67
notte
0.67
LEG
0.67
SignIn
0.66
POSITIVE LOGITS
conservative
1.79
conservatism
1.74
liberalism
1.66
conservatives
1.50
conservative
1.50
konserv
1.46
libertarian
1.39
Conservative
1.39
Conservative
1.32
feminism
1.28
Activations Density 0.159%