INDEX
    Explanations

    gratitude and well wishes

    New Auto-Interp
    Negative Logits
    :
    0.85
     egensk
    0.77
     hemorrh
    0.74
    存在する
    0.73
     epistem
    0.70
     obese
    0.70
     failures
    0.69
     terrified
    0.69
    最も
    0.68
    hematic
    0.68
    POSITIVE LOGITS
     thank
    1.31
     Thank
    1.29
     спасибо
    1.28
     kindly
    1.27
     Спасибо
    1.25
    Thank
    1.22
     lovely
    1.22
     graciously
    1.19
    Спасибо
    1.18
    ありがとうございます
    1.17
    Act Density 1.854%

    No Known Activations