INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    iações
    -0.08
     indivíduos
    -0.08
     envel
    -0.08
     зрел
    -0.07
     opinions
    -0.07
    uawei
    -0.07
    ുല
    -0.07
     aufgrund
    -0.07
    보고
    -0.07
     henkil
    -0.07
    POSITIVE LOGITS
     pár
    0.08
    gs
    0.08
    Α
    0.07
    și
    0.07
     bele
    0.07
    posts
    0.07
     trouble
    0.07
    GS
    0.07
    -to
    0.07
     Student
    0.07
    Act Density 0.077%

    No Known Activations