INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     کول
    -0.08
     davran
    -0.07
    קים
    -0.07
    ાયા
    -0.07
     تصور
    -0.07
    organizations
    -0.07
     seemed
    -0.07
     עבור
    -0.07
     husband
    -0.07
    Organizations
    -0.07
    POSITIVE LOGITS
     gens
    0.08
    fps
    0.08
    0.08
     DDR
    0.08
    rng
    0.07
     crisp
    0.07
     DP
    0.07
    FPS
    0.07
     acel
    0.07
     скорость
    0.07
    Act Density 0.001%

    No Known Activations