INDEX
Explanations
mentions of specific news outlets
references to specific newspapers or publications
New Auto-Interp
Negative Logits
chwitz
-0.77
awaru
-0.74
sed
-0.70
itars
-0.67
cles
-0.67
retard
-0.65
rely
-0.65
perm
-0.63
together
-0.63
ologically
-0.62
POSITIVE LOGITS
Tribune
1.25
Editorial
1.23
Newspaper
1.19
newspaper
1.12
editorial
1.06
Herald
1.01
Newsp
0.97
Sentinel
0.97
Gazette
0.94
Chronicle
0.94
Activations Density 0.087%