INDEX
    Explanations

    auxiliary verbs

    New Auto-Interp
    Negative Logits
    ुआ
    -0.06
     pronunciation
    -0.06
     careful
    -0.06
     Thực
    -0.06
     medication
    -0.06
    ación
    -0.06
    #print
    -0.06
     más
    -0.06
    ,她
    -0.06
    えない
    -0.06
    POSITIVE LOGITS
     Takım
    0.07
    0.06
     jinak
    0.06
    CENT
    0.06
     Gül
    0.06
    ISC
    0.06
    aycast
    0.06
    -quarter
    0.06
    ِين
    0.06
     Hyde
    0.06
    Act Density 0.166%

    No Known Activations