INDEX
    Explanations

    the presence of specific phonetic patterns or sequences in words

    New Auto-Interp
    Negative Logits
    kaç
    -0.15
    chooser
    -0.15
    ÄĻż
    -0.14
    uela
    -0.14
    atype
    -0.14
    itre
    -0.14
    liš
    -0.14
    rani
    -0.14
    losures
    -0.13
    webtoken
    -0.13
    POSITIVE LOGITS
    edList
    0.14
    arpa
    0.14
    ACTER
    0.13
    ffer
    0.13
     dear
    0.13
    oire
    0.13
    standing
    0.13
    ë³µ
    0.13
    edian
    0.13
     standing
    0.13
    Act Density 0.295%

    No Known Activations