INDEX
Explanations
words and phrases indicating claims or statements of alleged events or circumstances
New Auto-Interp
Negative Logits
and
-0.58
I
-0.57
A
-0.52
We
-0.49
we
-0.48
DECREF
-0.47
or
-0.46
\\
-0.45
-
-0.44
li
-0.44
POSITIVE LOGITS
allegedly
1.11
parsedMessage
1.08
edly
1.07
supposedly
1.03
alleged
0.94
reportedly
0.94
purported
0.93
supposed
0.93
EDEFAULT
0.93
supuestamente
0.93
Activations Density 0.226%