INDEX
Explanations
reports of deaths by various causes, particularly emphasizing suicides in different contexts and professions
New Auto-Interp
Negative Logits
elta
-0.22
sclerosis
-0.19
abil
-0.19
ilib
-0.19
fare
-0.19
robe
-0.19
mir
-0.19
hair
-0.19
ocr
-0.18
ocrates
-0.18
POSITIVE LOGITS
suicides
0.23
Tasman
0.20
Manit
0.20
igans
0.19
ipeg
0.19
ponds
0.17
istics
0.17
izations
0.17
itives
0.17
uates
0.17
Activations Density 0.524%