INDEX
Explanations
references to the Black Panther Party and related terms
references to the Black Panther Party and related social justice topics
New Auto-Interp
Negative Logits
laus
-0.82
fare
-0.75
gob
-0.72
urus
-0.72
vere
-0.71
forcing
-0.68
isen
-0.67
ve
-0.67
vers
-0.67
bane
-0.66
POSITIVE LOGITS
Manifest
0.74
resents
0.73
ONSORED
0.72
NAACP
0.71
Panther
0.71
iosyncr
0.70
ribution
0.69
ribut
0.68
NING
0.67
glim
0.65
Activations Density 0.018%