INDEX
Explanations
mentions of the Fox News network
references to "Fox News."
New Auto-Interp
Negative Logits
inval
-0.64
quo
-0.64
orically
-0.62
adding
-0.61
rians
-0.61
ussed
-0.60
umbn
-0.60
captcha
-0.60
accompan
-0.59
iments
-0.59
POSITIVE LOGITS
conn
1.53
News
1.06
News
1.02
croft
0.95
woods
0.93
borough
0.92
Broadcasting
0.91
cat
0.88
Sports
0.87
xy
0.86
Activations Density 0.021%