INDEX
    Explanations

    import/export

    New Auto-Interp
    Negative Logits
    akes
    -0.09
     disple
    -0.08
     dizzy
    -0.08
    ake
    -0.08
     spaghetti
    -0.08
     teclado
    -0.07
     intre
    -0.07
    /tests
    -0.07
     costume
    -0.07
     tým
    -0.07
    POSITIVE LOGITS
     refinance
    0.08
     helper
    0.08
    -peer
    0.08
     നേര
    0.08
     rim
    0.08
     vux
    0.08
     сын
    0.08
    融资
    0.08
     bereit
    0.08
     borrower
    0.08
    Act Density 0.018%

    No Known Activations