INDEX
Explanations
mentions of news organizations and journalists
references to news organizations and their reporting
New Auto-Interp
Negative Logits
lain
-0.71
bragging
-0.68
comed
-0.68
¯
-0.64
blogspot
-0.63
tumblr
-0.62
ãĥĹ
-0.61
fasc
-0.58
perenn
-0.57
Confederate
-0.57
POSITIVE LOGITS
correspondent
0.78
Investigative
0.73
obtained
0.68
investigation
0.67
ENE
0.65
yssey
0.65
understands
0.64
Asia
0.64
Probe
0.64
compr
0.63
Activations Density 0.123%