INDEX
Explanations
terms related to safety concerns or hazardous conditions
references to safety concerns or hazards
New Auto-Interp
Negative Logits
thood
-0.94
ophers
-0.90
ership
-0.89
soType
-0.88
pard
-0.84
bernatorial
-0.83
ingham
-0.82
uther
-0.81
cence
-0.80
vation
-0.80
POSITIVE LOGITS
unsafe
1.24
unnatural
0.94
improperly
0.81
unhealthy
0.80
unreasonable
0.78
improper
0.77
adolesc
0.73
hazardous
0.72
abnormal
0.70
wastes
0.70
Activations Density 0.013%