INDEX
Explanations
words related to protests and activism
New Auto-Interp
Negative Logits
needles
-0.72
noodles
-0.69
inia
-0.69
gorge
-0.67
ponds
-0.63
spears
-0.63
Cornell
-0.62
worms
-0.62
ieri
-0.62
swall
-0.60
POSITIVE LOGITS
blems
1.30
gression
1.22
gressive
1.21
secut
1.19
ceed
1.18
posal
1.16
digy
1.12
secution
1.07
spect
1.05
hibited
1.04
Activations Density 0.011%