INDEX
Explanations
actions related to carrying out attacks or operations
New Auto-Interp
Head Attr Weights
0:0.05
1:0.01
2:0.12
3:0.05
4:0.12
5:0.03
6:0.23
7:0.05
8:0.04
9:0.05
10:0.08
11:0.13
Negative Logits
hao
-1.42
workforce
-1.37
pregn
-1.32
doc
-1.30
hust
-1.30
uncond
-1.25
volunt
-1.22
rollout
-1.22
reckoning
-1.19
opio
-1.19
POSITIVE LOGITS
actionDate
1.72
eph
1.48
idan
1.45
related
1.34
sea
1.31
�
1.26
stones
1.26
uts
1.26
Nanto
1.25
�
1.24
Activations Density 0.007%