INDEX
    Explanations

    Portuguese language

    New Auto-Interp
    Negative Logits
     reviewed
    -0.08
     assault
    -0.08
    -0.08
    account
    -0.07
     certain
    -0.07
     shotgun
    -0.07
    试点
    -0.07
    ual
    -0.07
     attempt
    -0.07
    转运
    -0.07
    POSITIVE LOGITS
     Ele
    0.09
     geme
    0.08
     curved
    0.08
     Fle
    0.07
    _Se
    0.07
     המכ
    0.07
    Ele
    0.07
     ele
    0.07
     msec
    0.07
     discrimin
    0.07
    Act Density 0.012%

    No Known Activations