INDEX
Explanations
references to news agencies or press outlets
mentions of the Associated Press
New Auto-Interp
Negative Logits
arily
-0.68
imm
-0.63
underpin
-0.61
lessly
-0.60
dissolved
-0.58
rainbow
-0.56
gone
-0.56
horr
-0.56
peror
-0.56
sucker
-0.55
POSITIVE LOGITS
Press
1.39
Newsp
1.02
Journals
1.01
PRESS
0.93
States
0.79
â̦)
0.76
Nations
0.76
Franch
0.76
Journalists
0.74
Press
0.74
Activations Density 0.023%