INDEX
    Explanations

    work and effort

    New Auto-Interp
    Negative Logits
    brtc
    -0.07
    NP
    -0.07
    vět
    -0.07
     tří
    -0.06
    .ss
    -0.06
    selector
    -0.06
    Cb
    -0.06
    _graphics
    -0.06
    /services
    -0.06
    _pl
    -0.06
    POSITIVE LOGITS
     incorrectly
    0.07
     edilen
    0.06
     artist
    0.06
    0.06
     induction
    0.06
    0.06
    AGO
    0.06
    'utilisateur
    0.06
    uction
    0.06
    /↵
    0.06
    Act Density 0.015%

    No Known Activations