INDEX
    Explanations

    manipulative or persuasive language

    New Auto-Interp
    Negative Logits
    mybatisplus
    -0.83
     تضيفلها
    -0.76
     useRouter
    -0.75
    ']))
    
    -0.72
     jenner
    -0.71
    NUMX
    -0.70
    HECK
    -0.67
    virons
    -0.67
    RTDA
    -0.66
    первых
    -0.66
    POSITIVE LOGITS
    ↵↵
    0.51
    yyj
    0.46
    !!!”
    0.45
    </em>
    0.44
     begged
    0.43
     perdon
    0.43
    </blockquote>
    0.41
    0.41
     I
    0.41
     Appeal
    0.41
    Act Density 0.274%

    No Known Activations