INDEX
Explanations
references to significant violent events and their implications
New Auto-Interp
Negative Logits
igators
-0.15
leh
-0.15
Whip
-0.15
593
-0.14
Palace
-0.14
Petro
-0.13
arsed
-0.13
اصÙĦاØŃ
-0.13
Orleans
-0.13
ugar
-0.13
POSITIVE LOGITS
Pakistan
0.28
Pakistani
0.26
Pakistan
0.26
CIA
0.24
ISI
0.24
drone
0.22
Bin
0.22
bin
0.22
Predator
0.22
AQ
0.22
Activations Density 0.024%