INDEX
Explanations
references to women's lingerie and related garments
New Auto-Interp
Negative Logits
vrier
-0.15
ãģıãĤĵ
-0.15
imbus
-0.15
collateral
-0.15
boots
-0.14
kart
-0.14
urf
-0.14
xit
-0.14
mbH
-0.14
annot
-0.14
POSITIVE LOGITS
bra
0.37
bras
0.35
Bra
0.32
Bras
0.30
bra
0.28
cups
0.27
cup
0.27
Cups
0.26
bande
0.24
bras
0.24
Activations Density 0.041%