INDEX
Explanations
terms related to legal proceedings and sexual offenses
New Auto-Interp
Negative Logits
DoubleQuotes
-0.80
новниш
-0.78
onAnimation
-0.76
tagHelperRunner
-0.74
MENAFN
-0.71
ब्रेकडाउन
-0.69
שוליים
-0.66
ⓧ
-0.66
__':
-0.63
featureID
-0.63
POSITIVE LOGITS
sexual
1.46
Sexual
1.24
sexually
1.23
Sexual
1.20
sex
1.10
sexuelle
1.08
seksual
1.07
rape
1.04
sexual
1.03
SEX
0.97
Activations Density 0.199%