INDEX
Explanations
references to societal issues related to violence and the justice system
New Auto-Interp
Negative Logits
now
-0.07
ager
-0.06
daily
-0.06
Currently
-0.06
currently
-0.06
ais
-0.06
nightly
-0.06
ih
-0.06
hala
-0.06
currently
-0.06
POSITIVE LOGITS
usually
0.18
often
0.18
sometimes
0.16
often
0.15
usually
0.15
Often
0.14
Often
0.14
ometimes
0.13
Usually
0.13
sometimes
0.13
Activations Density 0.011%