INDEX
Explanations
references to age restrictions or age-related criteria
New Auto-Interp
Negative Logits
المعيارى
-0.71
faibles
-0.63
évaluateur
-0.61
resourceCulture
-0.56
expandindo
-0.56
faible
-0.55
InputDecoration
-0.55
小さ
-0.52
moindre
-0.51
скром
-0.51
POSITIVE LOGITS
adult
0.92
Adult
0.83
adult
0.81
Adult
0.80
(>
0.76
ADULT
0.76
adults
0.75
doros
0.72
dewasa
0.70
adultes
0.69
Activations Density 0.972%