INDEX
Explanations
mentions of historical figures and movements related to social justice and activism
New Auto-Interp
Negative Logits
Bieber
-0.16
LEV
-0.16
ichen
-0.15
unga
-0.15
ASE
-0.15
ÙĨص
-0.15
IMIT
-0.15
Frontier
-0.14
ovation
-0.14
/pi
-0.14
POSITIVE LOGITS
Panther
0.35
Panthers
0.34
Hue
0.28
pan
0.23
Malcolm
0.22
NOI
0.22
Pan
0.22
bpp
0.22
Mum
0.21
Pan
0.21
Activations Density 0.049%