INDEX
    Explanations

    statements expressing feelings of violation or distress related to personal rights or safety

    New Auto-Interp
    Negative Logits
     sort
    -0.20
     terrific
    -0.16
    sort
    -0.15
     pars
    -0.15
    -redux
    -0.15
     sorts
    -0.15
     interesting
    -0.14
     fewer
    -0.14
    unes
    -0.14
    acer
    -0.14
    POSITIVE LOGITS
    691
    0.15
     sir
    0.15
     Freem
    0.15
    ionario
    0.15
    .obtain
    0.14
     ÙħتÙĨ
    0.14
    _vect
    0.14
    vasion
    0.14
     terminology
    0.14
     Sir
    0.14
    Act Density 0.072%

    No Known Activations