INDEX
Explanations
references to specific operations or missions, particularly those associated with military or police actions
New Auto-Interp
Head Attr Weights
0:0.02
1:0.10
2:0.13
3:0.05
4:0.03
5:0.02
6:0.08
7:0.15
8:0.13
9:0.10
10:0.06
11:0.07
Negative Logits
constitu
-1.23
jri
-1.22
izens
-1.21
jriwal
-1.20
ournal
-1.19
profession
-1.13
proble
-1.13
psychiat
-1.12
proposition
-1.11
unemploy
-1.10
POSITIVE LOGITS
prim
1.36
destruct
1.16
Magn
1.16
rones
1.12
bol
1.11
Proxy
1.09
Zero
1.07
Controls
1.07
Allen
1.05
keys
1.04
Activations Density 0.011%