INDEX
Explanations
instances of negative political language
New Auto-Interp
Negative Logits
disemb
-0.83
hatch
-0.81
helper
-0.74
inactive
-0.73
defe
-0.73
-0.71
carbohyd
-0.71
transporter
-0.71
purse
-0.70
undet
-0.68
POSITIVE LOGITS
Moreover
1.39
Indeed
1.39
Worse
1.38
Meanwhile
1.32
Whereas
1.31
Likewise
1.30
Yet
1.30
Such
1.29
Ultimately
1.29
Nonetheless
1.28
Activations Density 1.743%