INDEX
Explanations
references to Fox News and related terms
references to Fox News
New Auto-Interp
Negative Logits
othing
-0.70
acterial
-0.68
ysis
-0.67
isu
-0.67
utic
-0.66
chest
-0.65
Ney
-0.65
expressive
-0.65
val
-0.64
onement
-0.64
POSITIVE LOGITS
Channel
1.25
Radio
0.95
Radio
0.93
Channel
0.92
Magazine
0.89
Talk
0.88
pund
0.86
Networks
0.85
News
0.84
commentator
0.82
Activations Density 0.033%