INDEX
Explanations
phrases related to societal or political issues and controversies
New Auto-Interp
Negative Logits
aceae
-0.90
igmatic
-0.77
è£
-0.71
lio
-0.70
rique
-0.67
utical
-0.67
sburg
-0.67
llular
-0.65
onder
-0.64
channelAvailability
-0.64
POSITIVE LOGITS
unfair
0.96
inaction
0.92
perceived
0.90
criticism
0.88
injustice
0.87
criticisms
0.87
accusations
0.86
questioning
0.83
suggestions
0.82
injust
0.81
Activations Density 2.702%