INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    opies
    -0.07
     ISIS
    -0.07
    Cam
    -0.06
     gamers
    -0.06
    _services
    -0.06
    lectual
    -0.06
     плит
    -0.06
     persuaded
    -0.06
    ışık
    -0.06
    Monthly
    -0.06
    POSITIVE LOGITS
    egen
    0.07
    ----------------------------
    0.07
    0.06
    0.06
    erken
    0.06
    -bin
    0.06
    ._↵↵
    0.06
     accordance
    0.06
    рег
    0.06
    CompanyName
    0.06
    Act Density 0.079%

    No Known Activations