INDEX
    Explanations

    expressions related to personal experiences and achievements

    expressions of gratitude and appreciation

    New Auto-Interp
    Negative Logits
    asury
    -0.60
     Officials
    -0.60
    aspers
    -0.60
    bsp
    -0.60
     Submit
    -0.59
    ãĥĥãĥĪ
    -0.58
    ãĥķãĤ¡
    -0.57
    umo
    -0.56
    PIN
    -0.56
    ãĥĥãĤ¯
    -0.55
    POSITIVE LOGITS
     sucks
    1.39
     helps
    1.38
     reminds
    1.36
     means
    1.36
     wasn
    1.33
     makes
    1.29
     entails
    1.29
     inspires
    1.28
     meant
    1.28
     gives
    1.28
    Act Density 0.442%

    No Known Activations