INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     زد
    -0.08
     ingr
    -0.07
    ował
    -0.07
    .raises
    -0.07
    -0.07
    Pear
    -0.07
     postul
    -0.07
     absoluta
    -0.07
     مال
    -0.06
    -0.06
    POSITIVE LOGITS
    hat
    0.08
    initiative
    0.08
    yeed
    0.08
    Gay
    0.08
    Gate
    0.07
    านุ
    0.07
    .
    ↵↵
    0.07
    hatan
    0.07
    Hdr
    0.07
     Gurgaon
    0.07
    Act Density 0.077%

    No Known Activations