INDEX
    Explanations

    expressions of gratitude and appreciation

    references to gratitude and appreciation towards others

    New Auto-Interp
    Negative Logits
    enum
    -0.74
     defaults
    -0.70
    ageddon
    -0.68
    ptive
    -0.68
     2030
    -0.65
    Elsewhere
    -0.65
     modifier
    -0.65
     Worse
    -0.64
    ivably
    -0.64
     dystop
    -0.63
    POSITIVE LOGITS
     kindness
    1.28
     generosity
    1.28
     invaluable
    1.27
     gracious
    1.21
     generous
    1.15
     kindly
    1.10
     generously
    1.09
     professionalism
    1.07
     dedication
    1.02
     thoughtful
    1.02
    Act Density 1.341%

    No Known Activations