INDEX
    Explanations

    expressions of gratitude or thanks

    New Auto-Interp
    Negative Logits
    ince
    -0.15
    elsey
    -0.14
    ż
    -0.14
    aley
    -0.14
    apr
    -0.14
    alth
    -0.13
    ardo
    -0.13
    SSIP
    -0.13
    cert
    -0.13
    oval
    -0.13
    POSITIVE LOGITS
     so
    0.30
     again
    0.25
     very
    0.24
     bunch
    0.24
     much
    0.21
     everyone
    0.21
     heaps
    0.20
     beaucoup
    0.18
    again
    0.18
     tons
    0.18
    Act Density 0.012%

    No Known Activations