INDEX
Explanations
mentions of people or entities wearing specific items or accessories
instances of the word "wearing."
New Auto-Interp
Negative Logits
cffffcc
-0.81
=-=-=-=-
-0.73
COUR
-0.69
later
-0.68
MQ
-0.66
demon
-0.66
deal
-0.65
izoph
-0.65
estine
-0.65
edia
-0.65
POSITIVE LOGITS
apparel
0.94
jeans
0.89
worn
0.87
clothing
0.84
shoes
0.83
clothes
0.80
ables
0.79
robes
0.79
underwear
0.79
ves
0.78
Activations Density 0.017%