INDEX
    Explanations

    phrases and conjunctions indicating recommendations or obligations

    New Auto-Interp
    Negative Logits
    addtogroup
    -0.16
    zdy
    -0.15
    _PED
    -0.15
    оваÑĢ
    -0.15
    ãĤīãģĽ
    -0.14
    raya
    -0.14
    ryn
    -0.14
    rack
    -0.14
    mlin
    -0.14
    337
    -0.14
    POSITIVE LOGITS
    vu
    0.15
    allet
    0.14
    hta
    0.14
    ovich
    0.14
     Kis
    0.14
    itter
    0.14
    à¤Ĥà¤Ł
    0.14
    lew
    0.13
    htags
    0.13
    oreal
    0.13
    Act Density 0.034%

    No Known Activations