INDEX
Explanations
references to domestic violence
New Auto-Interp
Negative Logits
*/(
-0.95
uyomi
-0.94
GOODMAN
-0.91
umper
-0.81
UMP
-0.80
SOURCE
-0.78
uden
-0.78
Reviewer
-0.75
DragonMagazine
-0.74
akeru
-0.72
POSITIVE LOGITS
Violence
1.03
violence
1.01
affairs
0.97
tranqu
0.90
abusers
0.87
abuser
0.82
abuse
0.82
estic
0.82
violence
0.81
appliances
0.81
Activations Density 0.008%