INDEX
Explanations
references to receiving news updates, promotions, and newsletters from The New York Times
New Auto-Interp
Negative Logits
abase
-0.66
ello
-0.61
onics
-0.61
uca
-0.60
aimon
-0.59
okane
-0.58
undown
-0.56
appell
-0.55
terday
-0.54
oshop
-0.53
POSITIVE LOGITS
APD
0.65
});
0.62
earable
0.60
Article
0.60
govtrack
0.58
Terms
0.57
trave
0.56
isodes
0.53
Gutenberg
0.52
Caller
0.51
Activations Density 4.843%