INDEX
    Explanations

    phrases and terms related to safety and security, particularly in the context of homes and communities

    New Auto-Interp
    Negative Logits
     Safety
    -0.42
     safety
    -0.41
    Safety
    -0.39
     safely
    -0.36
     saf
    -0.29
     safer
    -0.29
     safest
    -0.28
    afety
    -0.27
     SAF
    -0.27
    _SAFE
    -0.26
    POSITIVE LOGITS
     sound
    0.24
     Sound
    0.23
    Sound
    0.23
     Secure
    0.21
     secure
    0.20
     SOUND
    0.19
    sound
    0.19
    Secure
    0.19
    Sec
    0.17
    éŁ³
    0.17
    Act Density 0.024%

    No Known Activations