INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     tàu
    -0.06
    Routine
    -0.06
    ropsych
    -0.06
     Emotional
    -0.06
     Shared
    -0.06
     які
    -0.06
     lưới
    -0.06
     interpersonal
    -0.06
     Syrians
    -0.06
     WHO
    -0.06
    POSITIVE LOGITS
    nett
    0.07
    (class
    0.07
    .isBlank
    0.07
    .Drop
    0.07
     impost
    0.07
    _OID
    0.07
    طب
    0.06
     smirk
    0.06
    SID
    0.06
    IFI
    0.06
    Act Density 0.030%

    No Known Activations