INDEX
    Explanations

    words associated with kindness and positive attributes

    New Auto-Interp
    Negative Logits
     NSCoder
    -0.88
    IsContent
    -0.87
    makeConstraints
    -0.77
     Drapeau
    -0.75
    DockStyle
    -0.71
    rrggbb
    -0.69
    GMENT
    -0.67
     egna
    -0.66
    AddTagHelper
    -0.65
    KommentareTeilen
    -0.64
    POSITIVE LOGITS
     kindness
    1.16
    kindness
    0.97
     Kindness
    0.96
     generosity
    0.90
     generous
    0.84
     kindly
    0.83
     charitable
    0.80
     unkind
    0.79
     compassionate
    0.76
     kinder
    0.71
    Act Density 0.362%

    No Known Activations