INDEX
Explanations
phrases related to risks, hazards, and potential harm
terms related to health risks and safety concerns
New Auto-Interp
Negative Logits
swick
-0.76
Blocks
-0.71
lio
-0.68
vez
-0.67
ggles
-0.65
gdala
-0.63
atters
-0.62
vil
-0.62
ership
-0.62
FTWARE
-0.62
POSITIVE LOGITS
incurred
1.20
arising
1.15
associated
1.12
consequences
1.08
attributable
1.06
inherent
1.04
outweigh
1.03
outwe
1.00
resulting
0.99
stemming
0.98
Activations Density 0.271%