INDEX
Explanations
political accusations and denials
New Auto-Interp
Negative Logits
TagMode
-0.84
ArgsConstructor
-0.74
audiovisuel
-0.73
$.
-0.72
"},
-0.70
-0.69
zzleHttp
-0.68
]")]
-0.66
++
-0.66
`,
-0.66
POSITIVE LOGITS
rebuttal
0.57
claims
0.55
denies
0.53
accusations
0.52
defends
0.52
disagrees
0.51
defended
0.51
defense
0.51
disputed
0.50
Response
0.48
Activations Density 0.185%