INDEX
Explanations
concepts related to safety and risks, particularly concerning children and hazardous situations
New Auto-Interp
Negative Logits
_firestore
-0.18
ema
-0.16
/INFO
-0.15
lems
-0.14
<Service
-0.14
oin
-0.14
ands
-0.14
oken
-0.14
.tell
-0.14
ãģĵãĤĵ
-0.14
POSITIVE LOGITS
dangerous
0.22
dangers
0.19
-safe
0.18
safety
0.18
danger
0.17
-danger
0.17
danger
0.16
afe
0.16
freel
0.16
Dangerous
0.16
Activations Density 0.123%