INDEX
Explanations
accusations and denials in text
phrases related to denial of accusations
New Auto-Interp
Negative Logits
tnc
-0.81
ausp
-0.72
ibaba
-0.70
Sensor
-0.69
odes
-0.68
Measure
-0.66
ILCS
-0.65
rafted
-0.64
mosqu
-0.63
synergy
-0.63
POSITIVE LOGITS
denies
1.43
denied
1.38
refuted
1.33
deny
1.32
retracted
1.30
vehemently
1.25
disav
1.24
denying
1.21
disputed
1.21
contradicted
1.19
Activations Density 0.529%