INDEX
    Explanations

    concepts related to safety and risks, particularly concerning children and hazardous situations

    New Auto-Interp
    Negative Logits
    _firestore
    -0.18
    ema
    -0.16
    /INFO
    -0.15
    lems
    -0.14
    <Service
    -0.14
    oin
    -0.14
    ands
    -0.14
    oken
    -0.14
    .tell
    -0.14
    ãģĵãĤĵ
    -0.14
    POSITIVE LOGITS
     dangerous
    0.22
     dangers
    0.19
    -safe
    0.18
     safety
    0.18
     danger
    0.17
    -danger
    0.17
    danger
    0.16
    afe
    0.16
     freel
    0.16
     Dangerous
    0.16
    Act Density 0.123%

    No Known Activations