INDEX
    Explanations

    words related to potential dangers or risks

    references to various types of hazards

    New Auto-Interp
    Negative Logits
    gdala
    -1.03
    ebus
    -0.99
    artney
    -0.87
    olitan
    -0.83
    ergy
    -0.82
    ainers
    -0.82
    itton
    -0.79
    zos
    -0.79
    este
    -0.77
    atters
    -0.76
    POSITIVE LOGITS
     hazards
    1.13
    ously
    1.04
     hazard
    1.04
     mitigation
    0.86
     endanger
    0.86
     deterrent
    0.77
     lurking
    0.77
     lur
    0.73
     danger
    0.71
     peril
    0.71
    Act Density 0.017%

    No Known Activations