INDEX
Explanations
references to fear and safety concerning people's lives
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.13
3:0.08
4:0.22
5:0.03
6:0.08
7:0.19
8:0.04
9:0.03
10:0.06
11:0.05
Negative Logits
Module
-1.59
�
-1.43
elected
-1.39
soDeliveryDate
-1.38
icion
-1.36
Myth
-1.32
Recommended
-1.32
endum
-1.28
Guest
-1.27
aucus
-1.25
POSITIVE LOGITS
fractures
1.49
loo
1.34
aeda
1.30
lihood
1.29
relationship
1.28
helpless
1.27
shed
1.27
Cause
1.26
plight
1.25
loss
1.25
Activations Density 0.002%