INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     combo
    -0.07
    zheimer
    -0.07
     Zion
    -0.07
     Tradition
    -0.07
     کمک
    -0.07
    erialization
    -0.06
     Combo
    -0.06
     removal
    -0.06
     Computing
    -0.06
    bp
    -0.06
    POSITIVE LOGITS
     OSS
    0.07
    ρ
    0.07
    0.06
    :`
    0.06
     yasak
    0.06
    РО
    0.06
     subs
    0.06
     보여
    0.06
    ARS
    0.06
     Ağustos
    0.06
    Act Density 0.001%

    No Known Activations