INDEX
Explanations
publications or news outlets
proper nouns and names of organizations or entities
New Auto-Interp
Negative Logits
lain
-0.82
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.78
ãĤ¼ãĤ¦ãĤ¹
-0.71
rongh
-0.70
à¨
-0.69
masc
-0.68
enegger
-0.67
����
-0.65
ç¥ŀ
-0.64
orsche
-0.62
POSITIVE LOGITS
Journalism
1.02
Examiner
1.01
Observer
1.00
Reporter
0.99
News
0.98
Newsp
0.96
Journal
0.96
journal
0.94
Report
0.92
News
0.91
Activations Density 0.258%