INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Tek
    -0.08
     tek
    -0.08
     downstairs
    -0.08
     Roller
    -0.08
     Embedded
    -0.08
     bawah
    -0.07
     adr
    -0.07
     Hoffman
    -0.07
     geri
    -0.07
    הר
    -0.07
    POSITIVE LOGITS
    ney
    0.08
    nyi
    0.08
    ার্থ
    0.07
    {\
    0.07
    ifiable
    0.07
     rejo
    0.07
    ders
    0.07
    voy
    0.07
     squ
    0.07
    weging
    0.07
    Act Density 0.004%

    No Known Activations