INDEX
Explanations
words related to legal and criminal matters, specifically focusing on terms related to criminal offenders and sentencing
references to individuals who have committed crimes or offenses
New Auto-Interp
Negative Logits
ma
-0.69
Mom
-0.66
ype
-0.61
jet
-0.61
ira
-0.60
tz
-0.58
ruce
-0.57
fig
-0.57
Walt
-0.56
Sur
-0.56
POSITIVE LOGITS
offenders
3.72
offender
3.55
abusers
1.84
perpetrators
1.83
offending
1.76
culprit
1.70
offenses
1.64
criminals
1.63
offences
1.63
perpetrator
1.54
Activations Density 0.013%