INDEX
    Explanations

    expressions of gratitude and recognition

    New Auto-Interp
    Negative Logits
    claimed
    -0.15
     maybe
    -0.15
     EITHER
    -0.14
    ëĭ´
    -0.14
     only
    -0.14
     Only
    -0.13
     ONLY
    -0.13
    Only
    -0.13
    ãģĹãģĭ
    -0.13
     either
    -0.13
    POSITIVE LOGITS
     everyone
    0.40
     everybody
    0.34
    everyone
    0.32
     Everyone
    0.25
    Everyone
    0.25
     those
    0.23
     countless
    0.20
     past
    0.19
    those
    0.19
     semua
    0.19
    Act Density 0.135%

    No Known Activations