INDEX
    Explanations

    words expressing causal impact or influence (e.g. verbs like “exacerbate,” “impede,” “encourage,” “hurt,” “devalue,” etc.).

    New Auto-Interp
    Negative Logits
     веч
    -0.06
    ircular
    -0.06
     solve
    -0.06
    oor
    -0.06
     такие
    -0.06
    cers
    -0.06
    CW
    -0.06
    imated
    -0.06
     visas
    -0.06
     dolls
    -0.06
    POSITIVE LOGITS
    .Ge
    0.07
    /business
    0.07
     haciendo
    0.07
     ек
    0.06
     oppon
    0.06
     воно
    0.06
     shipment
    0.06
    Exchange
    0.06
    ющие
    0.06
     jiných
    0.06
    Act Density 0.096%

    No Known Activations