INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -zero
    -0.07
     سازی
    -0.06
     boasted
    -0.06
     valeur
    -0.06
    idebar
    -0.06
     Za
    -0.06
    .exists
    -0.06
    .pitch
    -0.06
     setbacks
    -0.06
     neglig
    -0.06
    POSITIVE LOGITS
     CORPOR
    0.07
    tips
    0.07
    SPELL
    0.07
    forecast
    0.06
     Forg
    0.06
    Recogn
    0.06
    Congress
    0.06
    fo
    0.06
    ...]
    0.06
     focus
    0.06
    Act Density 0.003%

    No Known Activations