INDEX
Explanations
words related to news and reporting
mentions of news
New Auto-Interp
Negative Logits
sei
-0.79
agra
-0.78
utters
-0.73
warm
-0.72
erect
-0.72
aughs
-0.71
asse
-0.70
heed
-0.70
lled
-0.68
vae
-0.67
POSITIVE LOGITS
headlines
1.01
news
0.96
NEWS
0.91
worthy
0.91
worthiness
0.82
News
0.82
reader
0.80
cannabin
0.79
orial
0.78
Coverage
0.78
Activations Density 0.032%