INDEX
Explanations
references to "The New York Times"
mentions of "The New York Times."
New Auto-Interp
Negative Logits
sed
-0.80
rd
-0.76
ded
-0.75
razil
-0.74
lain
-0.73
halla
-0.71
cius
-0.70
ressive
-0.70
itive
-0.68
ardless
-0.67
POSITIVE LOGITS
Magazine
0.90
Carbuncle
0.81
bestselling
0.80
Literary
0.79
Corpus
0.75
Square
0.74
Editorial
0.74
Newsp
0.72
Newsletter
0.71
Dispatch
0.70
Activations Density 0.015%