INDEX
Explanations
references to nudity or being naked
occurrences of the term "naked."
New Auto-Interp
Negative Logits
riers
-0.86
dule
-0.79
rier
-0.79
vernment
-0.78
enges
-0.78
Flavoring
-0.74
ENCY
-0.74
ãĥ£
-0.73
CE
-0.73
ound
-0.72
POSITIVE LOGITS
mole
0.98
naked
0.88
selfies
0.84
nude
0.80
silhou
0.77
selfie
0.76
Naked
0.74
ity
0.74
sun
0.74
portraits
0.74
Activations Density 0.033%