INDEX
Explanations
lessen costs or negative impact
New Auto-Interp
Negative Logits
因为它
-1.02
Кто
-0.97
seteq
-0.94
increased
-0.93
tembro
-0.91
sobriety
-0.91
當初
-0.91
افز
-0.90
отсутствие
-0.90
افزایش
-0.88
POSITIVE LOGITS
costs
1.60
chances
1.54
risk
1.47
reliance
1.45
dependence
1.45
amount
1.40
unnecessary
1.38
overall
1.36
number
1.33
risks
1.32
Activations Density 0.094%