INDEX
    Explanations

    quantifiers and indicators of quantity or proportions

    New Auto-Interp
    Negative Logits
    kara
    -0.17
    /fw
    -0.15
    ROTO
    -0.15
    extended
    -0.15
    ä¼Ĺ
    -0.14
    iaux
    -0.14
    heim
    -0.14
    etty
    -0.14
    çľ¾
    -0.14
     incons
    -0.14
    POSITIVE LOGITS
    uan
    0.17
    cken
    0.15
    ãĥĥãĤ«ãĥ¼
    0.15
    ikhail
    0.14
     including
    0.14
    -thumbnails
    0.14
    ErrorCode
    0.14
     cal
    0.14
     Wan
    0.14
     Nose
    0.13
    Act Density 0.021%

    No Known Activations