INDEX
Explanations
proper nouns related to news organizations
references to news publications, particularly "The Washington Post"
New Auto-Interp
Negative Logits
sbm
-0.83
sed
-0.76
gone
-0.74
stood
-0.68
directions
-0.63
wana
-0.62
hra
-0.62
milo
-0.61
pex
-0.61
plot
-0.60
POSITIVE LOGITS
Staff
0.78
Wire
0.78
pei
0.71
Buy
0.70
·
0.69
Toledo
0.68
ebus
0.68
toggle
0.65
ipeg
0.65
Associated
0.65
Activations Density 0.061%