INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     conect
    -0.07
    GetY
    -0.07
     Ipsum
    -0.06
     انقل
    -0.06
     особист
    -0.06
    nosti
    -0.06
    xs
    -0.06
    -0.06
    Logout
    -0.06
     Ronnie
    -0.06
    POSITIVE LOGITS
     Indicator
    0.06
    earable
    0.06
    fighter
    0.06
    requently
    0.06
     Dreams
    0.06
    łe
    0.06
    -context
    0.06
    :indexPath
    0.06
    313
    0.06
    MAND
    0.06
    Act Density 0.015%

    No Known Activations