INDEX
Explanations
references to safety regulations and incidents in industrial settings
safety concerns and injuries
mentions of physical harm, accidents, injuries, or workplace/operational safety hazards.
descriptions of physical hazards and injuries, especially accidents or animal attacks, and references to industrial safety/guarding measures that prevent them.
New Auto-Interp
Negative Logits
st
-0.29
voulu
-0.28
议
-0.27
得上
-0.27
ahead
-0.26
택
-0.25
ⓧ
-0.25
burgh
-0.25
skor
-0.25
siquiera
-0.24
POSITIVE LOGITS
Safety
0.60
Safety
0.59
Tikang
0.57
safety
0.56
fatal
0.56
fatalities
0.55
safety
0.54
Παραπομπές
0.54
SAFETY
0.54
zwiſchen
0.53
Activations Density 0.254%