INDEX
Explanations
references to news reports or articles mentioning certain events or claims
mentions of news reporting
New Auto-Interp
Negative Logits
venge
-0.85
creen
-0.77
ophers
-0.75
ificial
-0.72
educ
-0.70
knife
-0.69
palate
-0.69
otin
-0.69
benefit
-0.69
egal
-0.68
POSITIVE LOGITS
reports
0.90
reporting
0.80
è¦ļéĨĴ
0.78
ounces
0.75
Newsweek
0.74
Reporting
0.71
headlines
0.69
Reporter
0.69
Reports
0.69
Stories
0.69
Activations Density 0.054%