INDEX
    Explanations

    phrases related to physical destruction and injury

    words associated with violence and destruction

    New Auto-Interp
    Negative Logits
    eq
    -0.68
     href
    -0.66
    cest
    -0.63
     Flavoring
    -0.63
    tein
    -0.63
    rosso
    -0.61
    alone
    -0.59
     phr
    -0.59
    Sol
    -0.58
    argon
    -0.58
    POSITIVE LOGITS
    iHUD
    0.79
    tered
    0.67
     anew
    0.66
    stretched
    0.65
     deteriorated
    0.65
     hinges
    0.65
    aeus
    0.65
    arie
    0.62
     Fargo
    0.60
    hement
    0.59
    Act Density 0.715%

    No Known Activations