INDEX
Explanations
words denoting a person who has been harmed or injured, particularly in the context of a crime or wrongdoing
words related to victimization or legal proceedings
references to victims in various contexts
New Auto-Interp
Negative Logits
andel
-0.84
aeda
-0.74
ooks
-0.71
rosso
-0.70
hire
-0.69
leaf
-0.68
ort
-0.66
obar
-0.64
lav
-0.64
upp
-0.63
POSITIVE LOGITS
Victims
0.96
victim
0.93
izers
0.92
victims
0.92
Victim
0.91
izes
0.91
vict
0.86
Vict
0.85
hood
0.84
izer
0.82
Activations Density 0.017%