INDEX
Explanations
significant quotes or statements about social justice and responsibility
New Auto-Interp
Negative Logits
sworth
-0.16
inosaur
-0.15
afari
-0.15
ynn
-0.14
ALE
-0.14
sert
-0.14
ernels
-0.14
ngo
-0.14
leme
-0.14
èĩªæĭį
-0.14
POSITIVE LOGITS
fol
0.18
folks
0.17
America
0.17
Americans
0.17
Fol
0.16
akin
0.16
Freel
0.15
marsh
0.15
-Americ
0.15
ìĦŃ
0.14
Activations Density 0.053%