INDEX
    Explanations

    references to academic publications and reports

    New Auto-Interp
    Negative Logits
     receptions
    -0.16
    oÄŁlu
    -0.15
     despite
    -0.15
    igor
    -0.15
     and
    -0.15
     next
    -0.15
     to
    -0.15
     or
    -0.14
    ¶Į
    -0.14
    fone
    -0.14
    POSITIVE LOGITS
    ģn
    0.16
     pp
    0.15
     STATS
    0.14
    _except
    0.14
     fod
    0.14
     Ñģна
    0.14
    086
    0.13
     vol
    0.13
    ym
    0.13
    YP
    0.13
    Act Density 0.239%

    No Known Activations