INDEX
Explanations
phrases related to escalating conflicts and tensions
phrases related to conflict and escalation
New Auto-Interp
Negative Logits
Preservation
-0.76
unbiased
-0.72
enment
-0.71
natureconservancy
-0.70
Performance
-0.69
Correct
-0.67
Ħ¢
-0.66
humility
-0.65
cius
-0.65
optim
-0.64
POSITIVE LOGITS
threatening
1.02
escalate
0.95
Viol
0.95
aggravated
0.94
engulf
0.90
fury
0.89
escalated
0.89
intensify
0.88
erupt
0.88
vitri
0.87
Activations Density 0.759%