INDEX
Explanations
references to a specific television news network (Fox News)
mentions of the Fox News channel
New Auto-Interp
Negative Logits
acterial
-0.77
ansom
-0.72
akeru
-0.72
rians
-0.71
inval
-0.69
rian
-0.69
orically
-0.67
abulary
-0.66
Scotia
-0.66
attled
-0.65
POSITIVE LOGITS
conn
1.47
News
0.98
News
0.97
woods
0.84
hawk
0.83
Planet
0.82
Fox
0.81
FOX
0.80
Wire
0.79
fire
0.78
Activations Density 0.016%