INDEX
Explanations
mentions of clothing or items worn by individuals
instances of the word "wearing."
New Auto-Interp
Negative Logits
edia
-0.85
uddin
-0.70
deal
-0.68
MQ
-0.68
=-=-=-=-
-0.68
ISO
-0.64
estine
-0.64
Publisher
-0.64
DL
-0.64
article
-0.63
POSITIVE LOGITS
worn
1.14
wearer
1.02
apparel
1.02
clothing
0.98
jeans
0.94
wear
0.92
shoes
0.90
robes
0.88
uniforms
0.87
clothes
0.86
Activations Density 0.014%