INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     شرطونه
    0.71
     exponencial
    0.61
    ۰
    0.61
     предотвра
    0.60
     وړاندوینې
    0.60
     someplace
    0.60
    帳に追加
    0.58
     caregivers
    0.58
     cun
    0.57
     дат
    0.57
    POSITIVE LOGITS
    yn
    0.83
    as
    0.71
    ä
    0.71
    was
    0.70
    ens
    0.68
    س
    0.65
    all
    0.63
    ت
    0.62
    last
    0.61
    yan
    0.61
    Act Density 0.036%

    No Known Activations