INDEX
    Explanations

    requests for user feedback and engagement in the comments section

    New Auto-Interp
    Negative Logits
    ')):
    -0.66
    DriverManager
    -0.64
    cient
    -0.58
    __;
    -0.58
    ]};
    -0.58
    }\]
    -0.57
    ]='\
    -0.56
    Newswire
    -0.56
     recommandons
    -0.54
     resident
    -0.53
    POSITIVE LOGITS
    AndEndTag
    0.75
     commentaire
    0.66
    astify
    0.55
     feedbacks
    0.55
     bezeichneter
    0.53
     feedback
    0.52
     homophobic
    0.51
     فريبيس
    0.50
    giro
    0.50
     Penit
    0.49
    Act Density 0.120%

    No Known Activations