INDEX
Explanations
phrases related to dishonesty and deception
references to dishonesty and deception in political discourse
New Auto-Interp
Negative Logits
agos
-0.95
lear
-0.85
AMI
-0.79
Pets
-0.78
illin
-0.76
rieve
-0.76
plates
-0.74
ipeg
-0.72
uve
-0.72
Waves
-0.72
POSITIVE LOGITS
misrepresent
1.65
deceit
1.57
misinformation
1.56
disinformation
1.53
incompetence
1.49
deception
1.49
misleading
1.47
fraud
1.45
falsehood
1.45
ignorance
1.44
Activations Density 0.364%