INDEX
Explanations
phrases related to denial or refutation
references to denials or claims made by individuals or entities
New Auto-Interp
Negative Logits
æĹ
-0.78
hei
-0.76
wisely
-0.76
Visual
-0.75
udder
-0.73
must
-0.72
eps
-0.72
acent
-0.71
iour
-0.71
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.70
POSITIVE LOGITS
allegation
1.76
allegations
1.59
accusation
1.52
accusations
1.39
complaint
1.23
investigation
1.20
authenticity
1.17
discrepancy
1.17
dossier
1.16
incident
1.16
Activations Density 0.288%