INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    //
    -0.64
     surla
    -0.63
    LookAnd
    -0.60
    ChildScrollView
    -0.57
    protoimpl
    -0.57
     Paglinawan
    -0.56
    ToBounds
    -0.56
    iczne
    -0.55
    colgroup
    -0.55
    gents
    -0.54
    POSITIVE LOGITS
     vaisselle
    0.50
     épaules
    0.47
    SPATH
    0.47
    ěstí
    0.46
    nalité
    0.46
     affich
    0.44
    hql
    0.44
     colorés
    0.43
     artikkelen
    0.43
    بوابة
    0.41
    Act Density 0.002%

    No Known Activations