INDEX
Explanations
terms related to domestic and sexual violence, including their victims and the context surrounding abusive relationships
New Auto-Interp
Negative Logits
dafx
-0.56
jestic
-0.54
Stakes
-0.52
Cola
-0.51
iculous
-0.50
limat
-0.50
Encyklopedia
-0.50
verk
-0.50
minde
-0.49
ixon
-0.49
POSITIVE LOGITS
abusive
0.64
violence
0.64
abuse
0.63
domestic
0.63
bruises
0.61
violence
0.59
MockMvc
0.58
beat
0.58
ThroughAttribute
0.57
Violence
0.56
Activations Density 0.323%