INDEX
Explanations
phrases related to responding aggressively or defensively in an argument or conflict
instances of confrontational responses or criticisms
New Auto-Interp
Negative Logits
ocument
-0.77
Transform
-0.77
minster
-0.72
ancial
-0.71
scope
-0.70
utical
-0.68
benef
-0.66
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.64
soDeliveryDate
-0.64
inct
-0.64
POSITIVE LOGITS
accusing
0.91
criticism
0.85
questioning
0.84
critics
0.84
angrily
0.78
blaming
0.78
criticisms
0.77
mockery
0.76
commenters
0.75
critic
0.75
Activations Density 0.259%