INDEX
Explanations
references to criminal incidents involving men
references to male individuals, particularly mentioning their age
New Auto-Interp
Negative Logits
çļ
-0.74
Grid
-0.69
ãĤ¦
-0.68
Machines
-0.68
chars
-0.67
ffee
-0.65
Journals
-0.64
instr
-0.64
ellect
-0.63
aeda
-0.62
POSITIVE LOGITS
hunt
1.00
endez
0.88
who
0.83
nered
0.76
accused
0.74
gunman
0.74
WHO
0.74
SWAT
0.72
ishly
0.71
ish
0.70
Activations Density 0.128%