INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    rw
    -0.07
    yw
    -0.07
     mM
    -0.07
    119
    -0.07
     temper
    -0.06
     dataArray
    -0.06
    (provider
    -0.06
    _training
    -0.06
     немає
    -0.06
    alphabet
    -0.06
    POSITIVE LOGITS
     Click
    0.11
     click
    0.11
    .click
    0.10
    click
    0.10
    Click
    0.09
     clicks
    0.08
     clicked
    0.08
    ButtonClick
    0.08
     clicking
    0.08
     Touch
    0.07
    Act Density 0.021%

    No Known Activations