INDEX
Explanations
phrases related to articles or submissions, possibly with a call to action or donation request
instances of strong emotional expressions and reactions
New Auto-Interp
Negative Logits
courier
-0.70
Aval
-0.69
herself
-0.68
intended
-0.67
escription
-0.67
intend
-0.66
sacked
-0.65
uninterrupted
-0.65
estranged
-0.64
compr
-0.64
POSITIVE LOGITS
Anyway
1.46
Anyway
1.10
Seriously
1.07
Advertisement
1.07
Consider
1.01
Unless
0.98
Here
0.95
Of
0.94
fuck
0.94
Speaking
0.93
Activations Density 0.661%