INDEX
Explanations
phrases related to various forms of violence and actions against violence
references to violence and related discussions
New Auto-Interp
Negative Logits
sonian
-0.85
ocular
-0.82
é¾įå
-0.75
heit
-0.75
dit
-0.74
arton
-0.73
acle
-0.71
osition
-0.68
oplan
-0.68
Folder
-0.67
POSITIVE LOGITS
perpetrated
1.20
inflicted
1.01
against
0.93
directed
0.91
prevention
0.89
committed
0.85
towards
0.85
erupted
0.82
Against
0.82
toward
0.81
Activations Density 0.062%