INDEX
    Explanations

    expressions of appreciation or gratitude

    New Auto-Interp
    Negative Logits
    ing
    -0.86
    w
    -0.76
    ings
    -0.74
    ING
    -0.73
    ness
    -0.71
    est
    -0.71
    RLock
    -0.68
    c
    -0.67
    iness
    -0.67
    miary
    -0.65
    POSITIVE LOGITS
     myſelf
    1.19
     Forumite
    1.07
     themſelves
    1.07
     itſelf
    1.03
     ſeveral
    0.97
    uate
    0.95
    ^(@)
    0.93
     poffible
    0.92
    unate
    0.90
     himſelf
    0.90
    Act Density 0.342%

    No Known Activations