INDEX
Explanations
topics related to gun control and LGBTQ+ rights
New Auto-Interp
Negative Logits
izer
-0.28
ized
-0.26
ization
-0.25
eer
-0.24
ously
-0.24
izers
-0.24
ize
-0.23
naire
-0.22
hip
-0.20
aires
-0.20
POSITIVE LOGITS
ãĢħ
0.20
ery
0.18
istry
0.17
shot
0.17
//{{0.17
iness
0.17
yb
0.17
tober
0.16
rey
0.16
linger
0.16
Activations Density 0.638%