INDEX
    Explanations

    expressions of gratitude and appreciation

    New Auto-Interp
    Negative Logits
     therefore
    -0.19
     wiÄĻc
    -0.16
     Bard
    -0.16
     Therefore
    -0.16
    Therefore
    -0.16
     donc
    -0.16
     Congratulations
    -0.14
     congratulations
    -0.14
     then
    -0.14
    .then
    -0.14
    POSITIVE LOGITS
    xea
    0.19
    endar
    0.18
     especially
    0.17
    appa
    0.17
    plies
    0.16
    roti
    0.15
     HOLDER
    0.15
    ÙĪÙĨÛĮ
    0.15
    alink
    0.15
    istrovstvÃŃ
    0.15
    Act Density 0.125%

    No Known Activations