INDEX
Explanations
terms related to conspiracy theories and misinformation
terms associated with conspiracy theories and misinformation, particularly around climate change and fake news
New Auto-Interp
Negative Logits
ktop
-0.78
eatures
-0.75
iosyn
-0.74
ugal
-0.73
illes
-0.70
mediately
-0.70
blance
-0.69
served
-0.65
Duty
-0.65
discharged
-0.64
POSITIVE LOGITS
disinformation
1.14
debunk
1.10
debunked
1.10
misinformation
1.03
icist
1.02
propaganda
0.91
hoax
0.89
falsehood
0.87
denial
0.87
ervative
0.87
Activations Density 0.332%