INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     احساس
    -0.07
    -0.06
     ار
    -0.06
    Cube
    -0.06
    zcze
    -0.06
    bagai
    -0.06
    ersion
    -0.06
    -0.06
    یس
    -0.06
    unan
    -0.06
    POSITIVE LOGITS
    charg
    0.07
     Lump
    0.07
     Gall
    0.06
     Peripheral
    0.06
     Dodge
    0.06
    :both
    0.06
    mızı
    0.06
     issue
    0.06
    layers
    0.06
     Remed
    0.06
    Act Density 0.058%

    No Known Activations