INDEX
Explanations
mentions of tragic events and condolences
mentions of death and expressions of grief
New Auto-Interp
Negative Logits
direction
-0.76
ivalry
-0.73
Style
-0.72
mode
-0.71
irection
-0.68
iquette
-0.68
mosp
-0.67
wcsstore
-0.66
familiarity
-0.65
explanations
-0.64
POSITIVE LOGITS
murdered
0.99
slain
0.96
deceased
0.94
killed
0.94
drowned
0.91
abducted
0.91
martyr
0.90
gunned
0.88
detainee
0.88
kidnapped
0.86
Activations Density 0.352%