INDEX
    Explanations

    sentences that express a sense of duty or responsibility toward helping others

    New Auto-Interp
    Negative Logits
     Diſ
    -0.81
     CreateTagHelper
    -0.79
     Conſ
    -0.72
     Inſ
    -0.72
     Houſe
    -0.67
     Reſ
    -0.67
     disambiguazione
    -0.65
    >{@
    -0.64
    NameInMap
    -0.64
     uſed
    -0.64
    POSITIVE LOGITS
     Helping
    0.60
     compassionate
    0.57
     charities
    0.55
     altru
    0.55
     aiutare
    0.53
     charitable
    0.53
     charity
    0.52
     donate
    0.51
     repay
    0.51
    drawSprites
    0.51
    Act Density 0.273%

    No Known Activations