INDEX
Explanations
names of newspapers and companies
phrases that mention sources or authors in reporting
New Auto-Interp
Negative Logits
ripple
-0.57
sul
-0.55
anecd
-0.54
fallout
-0.54
shrug
-0.53
anyways
-0.53
sigh
-0.52
FontSize
-0.52
flush
-0.51
goose
-0.51
POSITIVE LOGITS
anium
0.69
AppData
0.68
iscover
0.63
agen
0.62
Publisher
0.60
és
0.60
vern
0.59
nex
0.59
psey
0.58
Appearances
0.58
Activations Density 0.441%