INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    DEST
    -0.07
     گفته
    -0.06
    /editor
    -0.06
    favicon
    -0.06
    Scope
    -0.06
     Pang
    -0.06
     Walter
    -0.06
     lash
    -0.06
     condoms
    -0.06
    Allocator
    -0.06
    POSITIVE LOGITS
    819
    0.08
     شع
    0.07
    -Speed
    0.07
    ,ev
    0.07
    avir
    0.06
    ("^
    0.06
    _proto
    0.06
    ูท
    0.06
    λλα
    0.06
    สล
    0.06
    Act Density 0.000%

    No Known Activations