INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     примен
    -0.08
     преимущества
    -0.08
    hik
    -0.07
     selling
    -0.07
     pien
    -0.07
    амо
    -0.07
     tla
    -0.07
    ична
    -0.07
     sms
    -0.07
     помощ
    -0.07
    POSITIVE LOGITS
     dodatk
    0.08
    0.07
     Conselho
    0.07
    (',',
    0.07
    Fun
    0.07
    õe
    0.07
     reinforcement
    0.07
    udge
    0.07
     mellett
    0.07
    (Color
    0.07
    Act Density 0.003%

    No Known Activations