INDEX
Explanations
statements of denial
instances of denial or rejection regarding claims or accusations
New Auto-Interp
Negative Logits
incinn
-0.93
enegger
-0.77
ARCH
-0.73
emetery
-0.73
Center
-0.69
clone
-0.69
Lex
-0.69
place
-0.67
GROUP
-0.67
Ranked
-0.66
POSITIVE LOGITS
denies
0.91
deny
0.84
outright
0.82
denied
0.77
denying
0.76
vehemently
0.74
extradition
0.73
excuses
0.73
wrongdoing
0.71
denial
0.71
Activations Density 0.016%