INDEX
    Explanations

    expressions related to approval or agreement

    New Auto-Interp
    Negative Logits
     propOrder
    -0.85
     '\\;'
    -0.75
    AndEndTag
    -0.71
     ↓,
    -0.67
     ModelExpression
    -0.67
     الحره
    -0.66
     numberOfRows
    -0.65
     typelib
    -0.64
    脚注の使い方
    -0.64
     condol
    -0.63
    POSITIVE LOGITS
     consented
    2.94
     consenting
    1.81
     consents
    1.34
    jenner
    0.46
     vore
    0.45
     مشين
    0.45
    #
    0.44
    pushFollow
    0.44
     Antrags
    0.44
    weisung
    0.43
    Act Density 0.001%

    No Known Activations