INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    URRE
    -0.07
    .neighbors
    -0.06
     آور
    -0.06
    venue
    -0.06
     affine
    -0.06
     deber
    -0.06
    -0.06
    .unique
    -0.06
    -*-
    -0.06
    angi
    -0.06
    POSITIVE LOGITS
     cloth
    0.14
     Rag
    0.10
     Cloth
    0.09
     rag
    0.09
    cloth
    0.07
    bestos
    0.07
    loth
    0.06
     mist
    0.06
     ceremon
    0.06
     bleach
    0.06
    Act Density 0.002%

    No Known Activations