INDEX
Explanations
references to civilian casualties and the implications of warfare
New Auto-Interp
Negative Logits
Needle
-0.15
andel
-0.14
çį
-0.14
ikip
-0.14
needle
-0.13
ģ
-0.13
dagger
-0.13
Injected
-0.13
andle
-0.13
ifetime
-0.13
POSITIVE LOGITS
civilian
0.25
civilians
0.23
collateral
0.20
Civ
0.20
Civil
0.17
targets
0.17
arget
0.16
atee
0.16
ecs
0.16
tics
0.16
Activations Density 0.066%