INDEX
Explanations
descriptions of violent incidents or abuse
New Auto-Interp
Negative Logits
rastructure
-0.87
omics
-0.85
sustainability
-0.84
osponsors
-0.84
erenn
-0.83
Innovation
-0.81
ahime
-0.79
odox
-0.78
ocus
-0.78
forecasting
-0.78
POSITIVE LOGITS
raping
1.25
verbally
1.16
raped
1.16
sexually
1.14
abusive
1.12
abuser
1.12
bruises
1.11
humiliating
1.10
orally
1.10
humili
1.10
Activations Density 3.911%