INDEX
    Explanations

    evaluative language regarding ethics and usefulness

    Positive adjectives

    positive adjectives followed by conjunctions

    New Auto-Interp
    Negative Logits
    AndEndTag
    -1.07
     Efq
    -1.03
    DeleteBehavior
    -1.00
    URLException
    -1.00
    expandindo
    -1.00
    AddTagHelper
    -0.98
     كومونز
    -0.95
    GEBURTSDATUM
    -0.95
    InjectAttribute
    -0.94
     Himo
    -0.94
    POSITIVE LOGITS
     for
    0.97
     in
    0.68
     to
    0.67
     against
    0.66
     when
    0.62
     if
    0.57
     on
    0.53
     during
    0.53
     at
    0.53
     here
    0.52
    Act Density 0.360%

    No Known Activations