INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    profil
    -0.07
    -beta
    -0.07
     ```
    -0.07
     boz
    -0.06
    pieces
    -0.06
     optics
    -0.06
     BMI
    -0.06
    Acts
    -0.06
    trand
    -0.06
     devices
    -0.06
    POSITIVE LOGITS
    0.07
    0.06
     Paypal
    0.06
     موفق
    0.06
     volver
    0.06
    Backing
    0.06
    0.06
    0.06
    sein
    0.06
    .vx
    0.05
    Act Density 0.007%

    No Known Activations