INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _btn
    -0.07
     dette
    -0.07
    _information
    -0.07
     acre
    -0.06
    *ft
    -0.06
     gating
    -0.06
     زر
    -0.06
     shows
    -0.06
     Cached
    -0.06
     bản
    -0.06
    POSITIVE LOGITS
    .commit
    0.14
    Date
    0.06
     JFK
    0.06
    uds
    0.06
     Reddit
    0.06
    owell
    0.06
    umar
    0.06
     Toe
    0.06
    gesture
    0.06
    0.06
    Act Density 0.001%

    No Known Activations