INDEX
Explanations
words related to clothing or attire
references to clothing or attire
New Auto-Interp
Negative Logits
birth
-0.71
Quincy
-0.69
depress
-0.67
ollow
-0.65
elaide
-0.64
gluc
-0.63
Liqu
-0.62
ollo
-0.61
asper
-0.61
Drama
-0.58
POSITIVE LOGITS
ters
0.98
outfits
0.96
outfit
0.95
apparel
0.93
swick
0.91
puter
0.86
mentation
0.84
ments
0.84
glers
0.79
ema
0.79
Activations Density 0.012%