INDEX
Explanations
terms related to social justice and political discussions, particularly focusing on race and historical figures
mentions of political ideologies and discussions around social identity
New Auto-Interp
Negative Logits
util
-0.89
76561
-0.82
didnt
-0.81
´
-0.77
lol
-0.74
Clan
-0.73
whilst
-0.73
confir
-0.73
!!
-0.73
stating
-0.72
POSITIVE LOGITS
NPR
1.26
Slate
1.22
POLITICO
1.20
Brookings
1.14
HuffPost
1.12
Pulitzer
1.02
Economist
1.02
NPR
0.99
Smithsonian
0.99
Vox
0.97
Activations Density 0.459%