INDEX
Explanations
content that is classified as offensive or inappropriate
New Auto-Interp
Negative Logits
رع
-0.35
⋱
-0.32
semper
-0.32
enumi
-0.32
GUILayout
-0.32
flashdata
-0.32
dogged
-0.31
sos
-0.31
withOpacity
-0.31
مهر
-0.30
POSITIVE LOGITS
offensive
1.17
nudity
1.10
objectionable
1.07
explicit
1.07
vulgar
1.05
obscene
1.04
Offensive
1.02
offensive
1.01
Explicit
1.01
Explicit
1.01
Activations Density 0.543%