INDEX
Explanations
mentions of people's deaths, including locations and ages
references to death and associated details such as time and place
New Auto-Interp
Negative Logits
ï¸ı
-0.81
ãĥ¼ãĤ¯
-0.76
soDeliveryDate
-0.72
selves
-0.71
notation
-0.70
jab
-0.68
XY
-0.68
bies
-0.68
Girls
-0.67
gey
-0.67
POSITIVE LOGITS
childbirth
1.21
hosp
1.00
hospital
0.98
infancy
0.93
cardiac
0.86
crem
0.85
prison
0.82
renal
0.81
liver
0.81
tragically
0.79
Activations Density 0.097%