INDEX
Explanations
news-related terms or phrases in titles
references to daily news updates and summaries
New Auto-Interp
Negative Logits
istically
-0.65
ably
-0.62
ĸļ
-0.62
naire
-0.61
alties
-0.60
acca
-0.59
concession
-0.58
forth
-0.57
exception
-0.56
ibly
-0.55
POSITIVE LOGITS
headlines
0.73
articles
0.67
stories
0.64
celeb
0.62
adish
0.62
0.62
hor
0.60
news
0.60
umbnails
0.59
aily
0.57
Activations Density 0.045%