INDEX
    Explanations

    references to organizations, political events, and actions

    New Auto-Interp
    Negative Logits
    fordable
    -0.63
     intersper
    -0.62
     disreg
    -0.60
     downvotes
    -0.59
     lorenzo
    -0.59
    hmmmm
    -0.59
     javier
    -0.58
     impra
    -0.57
     encomp
    -0.57
     eyel
    -0.57
    POSITIVE LOGITS
     akut
    0.58
    because
    0.57
     antik
    0.55
     because
    0.55
     optik
    0.55
    unless
    0.54
     altogether
    0.52
    DataSnapshot
    0.52
    Pozdrawiam
    0.52
    ModelForm
    0.51
    Act Density 0.662%

    No Known Activations