INDEX
    Explanations

    expressions related to assistance or support

    New Auto-Interp
    Negative Logits
    rrbracket
    -0.68
     منها
    -0.65
    }$​
    -0.62
     רבה
    -0.61
    Πηγές
    -0.59
    әне
    -0.58
     يعد
    -0.58
    numerusform
    -0.58
    டன்
    -0.57
    表示
    -0.57
    POSITIVE LOGITS
    󠁧
    0.73
     będ
    0.72
    addCriterion
    0.70
    دانشنامهٔ
    0.70
     derni
    0.69
     Slu
    0.69
    istoitu
    0.68
     Ptole
    0.67
    PARTIC
    0.65
     Dism
    0.65
    Act Density 0.286%

    No Known Activations