INDEX
    Explanations

    phrases related to helping and assisting others

    phrases referring to individuals or groups in need or facing challenges

    New Auto-Interp
    Negative Logits
    kefeller
    -0.69
    zeb
    -0.67
     Alive
    -0.66
     Slim
    -0.65
     carnage
    -0.65
     bloodshed
    -0.62
     convincing
    -0.62
    uphem
    -0.60
     manslaughter
    -0.57
     Mandal
    -0.56
    POSITIVE LOGITS
     might
    1.16
     may
    1.11
     otherwise
    1.09
     want
    1.05
     need
    1.01
    might
    0.99
     wish
    0.97
    want
    0.97
     wished
    0.96
     rely
    0.96
    Act Density 0.213%

    No Known Activations