INDEX
    Explanations

    phrases related to denial and rejection

    New Auto-Interp
    Negative Logits
     sre
    -0.48
     both
    -0.44
     رسیده
    -0.44
     potenza
    -0.44
    viron
    -0.44
    Sinopsis
    -0.44
     Kaufmann
    -0.43
    épar
    -0.43
    їн
    -0.43
     Chid
    -0.43
    POSITIVE LOGITS
    Personensuche
    1.10
    IntoConstraints
    1.07
     مشين
    0.91
    aarrggbb
    0.90
    stdc
    0.81
    TagMode
    0.81
     []:
    0.81
    AndEndTag
    0.78
     OFDb
    0.75
    ScopeManager
    0.73
    Act Density 0.341%

    No Known Activations