INDEX
    Explanations

    expressions of well-wishing and positivity

    New Auto-Interp
    Negative Logits
     thanks
    -0.78
    thanks
    -0.73
     THANKS
    -0.73
     thank
    -0.71
     Thanks
    -0.69
    Thanks
    -0.69
     gracias
    -0.68
     takk
    -0.67
     thankful
    -0.64
     Thx
    -0.63
    POSITIVE LOGITS
    rungsseite
    0.86
    
    0.71
    
    0.66
    Portail
    0.62
    NameInMap
    0.57
    kháu
    0.56
    HtmlAttribute
    0.56
     TestBed
    0.55
    المناصب
    0.55
    CPtr
    0.55
    Act Density 0.025%

    No Known Activations