INDEX
Explanations
phrases related to causing harm, damage, and negative consequences
phrases related to harm and damage
New Auto-Interp
Negative Logits
*/(
-0.73
IQ
-0.72
quire
-0.71
natureconservancy
-0.69
lio
-0.68
dayName
-0.67
oS
-0.67
FN
-0.66
soDeliveryDate
-0.66
ophy
-0.65
POSITIVE LOGITS
inflicted
1.55
casualties
1.08
wounds
1.07
suffered
1.07
griev
1.07
incurred
1.06
injuries
1.03
loss
1.01
wounding
0.99
damages
0.99
Activations Density 0.675%