INDEX
Explanations
instances of violence or conflict
New Auto-Interp
Negative Logits
DebuggerStep
-0.62
aksanaan
-0.54
AccessorTable
-0.53
Tembelea
-0.53
ivelany
-0.51
stanga
-0.50
utafitiHapana
-0.49
EconPapers
-0.49
skak
-0.48
indl
-0.48
POSITIVE LOGITS
mortar
0.83
shelling
0.71
artillery
0.71
clashes
0.71
mortars
0.71
rocket
0.69
Mortar
0.67
rockets
0.67
airst
0.64
firing
0.64
Activations Density 0.258%