INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ْر
    -0.06
     ostream
    -0.06
     mitt
    -0.06
     rop
    -0.06
     scrape
    -0.06
     dipped
    -0.06
     contradictory
    -0.06
     mar
    -0.06
     jPanel
    -0.06
     Scientists
    -0.06
    POSITIVE LOGITS
     Ty
    0.09
     ty
    0.08
    истра
    0.07
    aysia
    0.07
    ty
    0.07
    isy
    0.07
    ftime
    0.06
     бух
    0.06
    일반
    0.06
     Tacoma
    0.06
    Act Density 0.009%

    No Known Activations