INDEX
    Explanations

    words related to violent criminal activities, specifically murder

    words associated with murder or violent acts

    New Auto-Interp
    Negative Logits
     Tibetan
    -0.65
    OHN
    -0.65
     Pixie
    -0.65
    AU
    -0.64
     Shinra
    -0.64
     Fundamental
    -0.64
     Shining
    -0.62
    orers
    -0.59
     Hercules
    -0.58
     Palo
    -0.58
    POSITIVE LOGITS
    mur
    1.38
    ãĥ£
    0.94
    boats
    0.88
    Mur
    0.86
    geon
    0.86
    boat
    0.86
    izont
    0.84
    cloth
    0.83
    amaz
    0.82
    chief
    0.82
    Act Density 0.005%

    No Known Activations