INDEX
Explanations
references to a specific news organization, specifically "Guardian."
mentions of "The Guardian" media outlet
New Auto-Interp
Negative Logits
perm
-0.87
cles
-0.84
itionally
-0.84
lease
-0.82
anners
-0.76
inant
-0.75
ramid
-0.74
jri
-0.74
merce
-0.74
NetMessage
-0.72
POSITIVE LOGITS
Angels
1.00
Editorial
0.92
Angel
0.85
Observer
0.82
Newsp
0.79
Newspaper
0.79
Leaks
0.78
columnist
0.78
Agency
0.77
Correspond
0.77
Activations Density 0.022%