INDEX
Explanations
terms associated with sexual violence and abuse
New Auto-Interp
Negative Logits
Lázaro
-0.59
surla
-0.58
ichten
-0.57
baiki
-0.56
متحده
-0.54
optimis
-0.54
atikan
-0.53
facie
-0.53
GLint
-0.52
ढ
-0.52
POSITIVE LOGITS
abuse
1.25
abused
1.12
Abuse
1.09
abuse
1.07
rape
1.07
Abuse
1.07
abus
1.04
raped
0.98
abusive
0.97
abuses
0.97
Activations Density 0.414%