INDEX
Explanations
references to damage and its effects
New Auto-Interp
Negative Logits
abestanden
-0.64
setVerticalGroup
-0.59
stateProvider
-0.53
ições
-0.51
AndEndTag
-0.50
Према
-0.50
banger
-0.48
menudo
-0.48
fxml
-0.47
atorship
-0.47
POSITIVE LOGITS
inflicted
1.11
done
1.00
sustained
0.95
suffered
0.85
done
0.81
DONE
0.77
caused
0.76
dealt
0.76
Done
0.74
incurred
0.72
Activations Density 0.228%