INDEX
    Explanations

    expressions of gratitude and appreciation

    New Auto-Interp
    Negative Logits
     thank
    -0.29
     Thank
    -0.27
     thanking
    -0.26
     thanked
    -0.24
    Thank
    -0.23
    thank
    -0.23
     Thanks
    -0.22
     thanks
    -0.22
     THANK
    -0.22
    Thanks
    -0.19
    POSITIVE LOGITS
     appreciated
    0.32
     appreciate
    0.31
     Apprec
    0.28
     appreciation
    0.25
     apprec
    0.19
    ToOne
    0.16
    uger
    0.16
    áp
    0.15
    olist
    0.15
    OMUX
    0.14
    Act Density 0.063%

    No Known Activations