INDEX
Explanations
news-related keywords
occurrences of the word "News" in various contexts
New Auto-Interp
Negative Logits
actory
-0.71
itent
-0.71
otent
-0.70
ibr
-0.70
resent
-0.70
ading
-0.69
skilled
-0.68
ugh
-0.68
aded
-0.67
goddamn
-0.67
POSITIVE LOGITS
News
1.25
NEWS
1.00
Articles
0.92
headlines
0.91
Content
0.88
News
0.88
orial
0.85
Coverage
0.85
Seym
0.82
Reporting
0.82
Activations Density 0.009%