INDEX
Explanations
references to news sources or media outlets (e.g., The Daily Caller News Foundation, The Daily Beast)
references to specific news organizations and their content
New Auto-Interp
Negative Logits
worldly
-0.76
gone
-0.67
MLG
-0.67
ogun
-0.66
Ö¼
-0.65
actionDate
-0.63
Bulg
-0.62
abolic
-0.62
gom
-0.61
py
-0.59
POSITIVE LOGITS
NEWS
0.93
£ı
0.87
Gazette
0.85
sylv
0.80
Newsp
0.80
Digest
0.78
Truth
0.76
lishes
0.72
NETWORK
0.72
News
0.70
Activations Density 0.033%