INDEX
Explanations
references to the New York Times
mentions of "The New York Times."
New Auto-Interp
Negative Logits
reating
-0.80
ded
-0.74
rals
-0.74
resent
-0.73
utch
-0.73
icides
-0.72
iov
-0.72
ressive
-0.71
rown
-0.70
leased
-0.70
POSITIVE LOGITS
Square
0.85
bestselling
0.85
Magazine
0.84
Dispatch
0.75
Literary
0.75
Reader
0.74
Publishing
0.73
Observer
0.72
0.71
conclud
0.71
Activations Density 0.024%