INDEX
Explanations
references to victims in various contexts such as abuse, accidents, or legal cases
references to victims in various contexts
New Auto-Interp
Negative Logits
CLASSIFIED
-0.70
OPLE
-0.68
GGGG
-0.66
ove
-0.66
REE
-0.65
Huck
-0.63
alter
-0.63
leaf
-0.62
yip
-0.62
Style
-0.61
POSITIVE LOGITS
victims
1.00
Victims
0.92
vict
0.90
victim
0.83
izers
0.82
inflicted
0.80
Victim
0.80
Survivors
0.77
hip
0.74
succumbed
0.74
Activations Density 0.021%