INDEX
Explanations
phrases and terms related to safety and security, particularly in the context of homes and communities
New Auto-Interp
Negative Logits
Safety
-0.42
safety
-0.41
Safety
-0.39
safely
-0.36
saf
-0.29
safer
-0.29
safest
-0.28
afety
-0.27
SAF
-0.27
_SAFE
-0.26
POSITIVE LOGITS
sound
0.24
Sound
0.23
Sound
0.23
Secure
0.21
secure
0.20
SOUND
0.19
sound
0.19
Secure
0.19
Sec
0.17
éŁ³
0.17
Activations Density 0.024%