INDEX
Explanations
references to violence and related topics, such as discussions, condemnation, and potential solutions
mentions of violence and its implications in various contexts
New Auto-Interp
Negative Logits
sonian
-0.89
ocular
-0.78
é£
-0.78
é¾įå
-0.77
ITNESS
-0.77
dit
-0.71
glas
-0.70
gres
-0.68
acle
-0.67
amina
-0.67
POSITIVE LOGITS
perpetrated
1.30
against
1.19
Against
1.07
inflicted
1.04
prevention
0.99
directed
0.98
against
0.98
committed
0.95
rained
0.93
toward
0.93
Activations Density 0.060%