INDEX
    Explanations

    specific suffix patterns in words, particularly those ending with -e or -ple

    New Auto-Interp
    Negative Logits
    localctx
    -0.58
     članak
    -0.57
     Darum
    -0.53
    出版年
    -0.52
    INARY
    -0.51
    DEF
    -0.50
     vPvB
    -0.49
     województwie
    -0.48
    iles
    -0.47
    jini
    -0.46
    POSITIVE LOGITS
    ath
    0.64
    ats
    0.63
    atst
    0.59
     تضيفلها
    0.58
    aling
    0.52
    alth
    0.51
    aten
    0.51
    asun
    0.50
    ATS
    0.50
    ather
    0.50
    Act Density 0.325%

    No Known Activations