INDEX
Explanations
references to promotions, offers, and newsletters from The New York Times
New Auto-Interp
Negative Logits
abase
-0.67
groom
-0.66
rador
-0.60
undown
-0.60
moniker
-0.58
ulz
-0.57
mble
-0.57
persuasion
-0.55
befriend
-0.55
lehem
-0.55
POSITIVE LOGITS
autions
0.72
isodes
0.71
VIDEOS
0.70
govtrack
0.66
endif
0.66
Terms
0.64
olesterol
0.63
uries
0.60
isms
0.60
newsletters
0.59
Activations Density 0.015%