INDEX
Explanations
mentions of investigating or looking into dishonesty and corruption
questions and mentions of specific social or political issues
New Auto-Interp
Negative Logits
xual
-0.75
pta
-0.72
mounted
-0.67
urch
-0.64
ents
-0.63
disabled
-0.63
normally
-0.61
fur
-0.61
una
-0.61
otally
-0.60
POSITIVE LOGITS
Detailed
0.85
VIDEOS
0.76
Politics
0.73
Nap
0.73
Advertisements
0.72
Latest
0.72
NPR
0.71
Nov
0.71
Quotes
0.69
Discover
0.67
Activations Density 0.210%