INDEX
Explanations
references to dangerous or lethal situations
references to fatal incidents or hazardous situations
New Auto-Interp
Negative Logits
ourced
-0.88
agate
-0.83
ership
-0.82
arte
-0.81
erous
-0.81
arity
-0.81
orers
-0.78
estamp
-0.78
CLIENT
-0.77
yk
-0.76
POSITIVE LOGITS
poisonous
0.94
wounding
0.91
poison
0.90
deadly
0.89
assault
0.85
lethal
0.82
dose
0.81
toll
0.81
fighting
0.78
overdose
0.78
Activations Density 0.010%