INDEX
Explanations
verbs related to instigating actions or reactions
terms related to provocation and its consequences
New Auto-Interp
Negative Logits
oard
-0.75
Accounting
-0.74
aver
-0.71
aum
-0.71
league
-0.70
ultz
-0.70
ayer
-0.69
ithing
-0.69
olitan
-0.67
amacare
-0.67
POSITIVE LOGITS
provocation
1.25
provoke
1.14
provoking
1.02
provoked
0.95
provocative
0.91
xual
0.90
bystanders
0.90
prov
0.81
laughter
0.80
prompt
0.80
Activations Density 0.018%