INDEX
    Explanations

    references to groups of people, especially in contexts involving students or youth

    New Auto-Interp
    Negative Logits
    chter
    -0.17
    ACS
    -0.15
     αÏĨ
    -0.14
    .sax
    -0.14
    ãĥ³ãĥĢ
    -0.14
    -os
    -0.14
    stub
    -0.14
     Hern
    -0.13
    kenin
    -0.13
     mour
    -0.13
    POSITIVE LOGITS
    aret
    0.17
    ozor
    0.16
    Rpc
    0.15
    zik
    0.15
    olls
    0.15
     Charges
    0.14
    лива
    0.14
    _WM
    0.13
    NAS
    0.13
    rollable
    0.13
    Act Density 0.262%

    No Known Activations