INDEX
    Explanations

    references to bachelor's degrees

    New Auto-Interp
    Negative Logits
    ib
    -0.17
     subs
    -0.17
    ný
    -0.15
    abis
    -0.15
     penalty
    -0.15
    ibu
    -0.14
    ibal
    -0.14
    ÑħÑĸд
    -0.14
    owe
    -0.14
    ip
    -0.14
    POSITIVE LOGITS
    itorio
    0.17
    endez
    0.15
    iez
    0.15
    üp
    0.15
    utter
    0.15
    ares
    0.15
    ENCHMARK
    0.15
    -ln
    0.15
    ιά
    0.14
    criptor
    0.14
    Act Density 0.006%

    No Known Activations