INDEX
    Explanations

    expressions of gratitude and appreciation

    expressions of gratitude and support toward individuals or groups

    New Auto-Interp
    Negative Logits
    ptive
    -0.82
    EStream
    -0.77
    ilibrium
    -0.77
    atten
    -0.77
    enum
    -0.77
    hex
    -0.69
    ilib
    -0.69
     dystop
    -0.68
    erenn
    -0.68
     dystopian
    -0.68
    POSITIVE LOGITS
     kindness
    1.29
     generosity
    1.26
     invaluable
    1.18
     generous
    1.15
     gracious
    1.13
     generously
    1.13
     professionalism
    1.10
     tirelessly
    1.09
     hospitality
    1.08
     persever
    1.02
    Act Density 0.980%

    No Known Activations