INDEX
Explanations
news agency names
references to the news organization Reuters
New Auto-Interp
Negative Logits
gran
-0.80
ysis
-0.72
icular
-0.71
alities
-0.69
iencies
-0.69
gone
-0.63
constants
-0.62
Redditor
-0.61
oven
-0.59
flo
-0.58
POSITIVE LOGITS
Reuters
0.87
Money
0.77
News
0.76
ournal
0.73
+)
0.73
Associated
0.72
)—
0.72
PLIED
0.71
agascar
0.71
CF
0.71
Activations Density 0.006%