INDEX
    Explanations

    phrases related to safety and caution

    concepts related to safety and safe environments

    New Auto-Interp
    Negative Logits
    lez
    -0.83
    xual
    -0.72
    erent
    -0.69
    Lenin
    -0.66
     intensify
    -0.65
    yss
    -0.65
    */(
    -0.65
    çīĪ
    -0.63
    rots
    -0.62
    AX
    -0.61
    POSITIVE LOGITS
    ounters
    0.70
     safe
    0.69
    iland
    0.67
    safe
    0.65
    dden
    0.62
    Safety
    0.60
     Safe
    0.60
     childbirth
    0.59
    glers
    0.59
    aband
    0.58
    Act Density 0.264%

    No Known Activations