INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    endphp
    -0.90
     kit
    -0.90
    ی
    -0.84
     KIT
    -0.83
    er
    -0.82
    ه
    -0.81
     Kit
    -0.79
    sant
    -0.78
    KIT
    -0.76
     Trip
    -0.71
    POSITIVE LOGITS
    انجليز
    0.51
     ciasc
    0.51
    the
    0.50
     Connectez
    0.50
    ally
    0.50
     ouvertes
    0.50
     giapp
    0.48
     ordinaires
    0.48
    tt
    0.48
     suivantes
    0.48
    Act Density 1.588%

    No Known Activations