INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bitk
    -0.06
    ellation
    -0.06
    .isTrue
    -0.06
     С
    -0.06
    пеки
    -0.06
     FA
    -0.06
     oluş
    -0.06
    omid
    -0.06
    -0.06
     piping
    -0.06
    POSITIVE LOGITS
     drawer
    0.15
     drawers
    0.14
     Drawer
    0.10
    rawer
    0.10
    drawer
    0.09
    _drawer
    0.09
    Drawer
    0.08
    'er
    0.07
    href
    0.07
     Adler
    0.07
    Act Density 0.001%

    No Known Activations