INDEX
Explanations
references to a specific news outlet, Breitbart News
references to the news outlet Breitbart
New Auto-Interp
Negative Logits
phis
-0.94
ktop
-0.87
mble
-0.79
awks
-0.79
captcha
-0.78
incent
-0.77
araoh
-0.75
Gael
-0.74
aredevil
-0.74
uay
-0.71
POSITIVE LOGITS
itbart
0.92
News
0.82
NEWS
0.77
Talk
0.74
Centauri
0.73
Breitbart
0.73
Found
0.70
Books
0.70
istas
0.69
News
0.68
Activations Density 0.009%