INDEX
Explanations
names of news agencies and possibly associated types of content
references to the Associated Press
New Auto-Interp
Negative Logits
ment
-0.79
gaard
-0.66
stabil
-0.65
pants
-0.64
stood
-0.63
trout
-0.62
tty
-0.62
abiding
-0.61
Founders
-0.61
gluten
-0.60
POSITIVE LOGITS
PLIED
1.09
TN
1.01
PLIC
1.00
PLE
0.96
ocalypse
0.94
OE
0.89
PLA
0.88
CBC
0.87
rison
0.86
HAEL
0.86
Activations Density 0.011%