INDEX
Explanations
references to specific publications like "The New York Times" and "Los Angeles Times" in different contexts
New Auto-Interp
Negative Logits
sed
-0.83
cles
-0.74
gone
-0.67
unts
-0.66
venge
-0.64
ardless
-0.63
bands
-0.63
shift
-0.63
isable
-0.63
together
-0.62
POSITIVE LOGITS
Editorial
1.27
editorial
1.23
Newsp
1.20
Magazine
1.18
columnist
1.15
reporter
1.10
Newspaper
1.09
Editors
1.09
reporters
1.06
editors
1.05
Activations Density 1.560%