INDEX
    Explanations

    words and phrases related to actions and performances

    New Auto-Interp
    Negative Logits
    someone
    -0.22
     someone
    -0.21
     somebody
    -0.20
    ä¸Ģ个人
    -0.20
     sebuah
    -0.20
     Someone
    -0.19
     alguien
    -0.18
    si
    -0.17
    Someone
    -0.17
    htar
    -0.15
    POSITIVE LOGITS
     quite
    0.23
     such
    0.21
     Quite
    0.20
    quite
    0.18
     SUCH
    0.18
     amore
    0.17
     somewhat
    0.15
    pend
    0.14
    arg
    0.14
    [
    0.14
    Act Density 0.300%

    No Known Activations