INDEX
Explanations
manipulative or persuasive language
New Auto-Interp
Negative Logits
mybatisplus
-0.83
تضيفلها
-0.76
useRouter
-0.75
']))
-0.72
jenner
-0.71
NUMX
-0.70
HECK
-0.67
virons
-0.67
RTDA
-0.66
первых
-0.66
POSITIVE LOGITS
↵↵
0.51
yyj
0.46
!!!”
0.45
</em>
0.44
begged
0.43
perdon
0.43
</blockquote>
0.41
”
0.41
I
0.41
Appeal
0.41
Activations Density 0.274%