INDEX
    Explanations

    references to societal issues related to violence and the justice system

    New Auto-Interp
    Negative Logits
     now
    -0.07
    ager
    -0.06
     daily
    -0.06
     Currently
    -0.06
     currently
    -0.06
    ais
    -0.06
     nightly
    -0.06
    ih
    -0.06
     hala
    -0.06
    currently
    -0.06
    POSITIVE LOGITS
     usually
    0.18
     often
    0.18
     sometimes
    0.16
    often
    0.15
    usually
    0.15
     Often
    0.14
    Often
    0.14
    ometimes
    0.13
     Usually
    0.13
    sometimes
    0.13
    Act Density 0.011%

    No Known Activations