INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     crawler
    -0.07
    ponsor
    -0.06
    emade
    -0.06
    _AGENT
    -0.06
    onent
    -0.06
    .Meta
    -0.06
    lied
    -0.06
     artır
    -0.06
    ptom
    -0.06
    OID
    -0.06
    POSITIVE LOGITS
     (%
    0.07
    (sz
    0.06
     اس
    0.06
     executing
    0.06
    neutral
    0.06
     همه
    0.06
    کنون
    0.06
     prz
    0.06
     Lottery
    0.06
     Michele
    0.06
    Act Density 0.014%

    No Known Activations