INDEX
Explanations
references to wearing or not wearing specific items of clothing
instances of the word "wear" and its variations
New Auto-Interp
Negative Logits
aminer
-0.75
=-=-=-=-
-0.74
Debor
-0.71
demon
-0.71
estine
-0.70
raction
-0.69
akespeare
-0.68
heid
-0.67
yrinth
-0.65
ommod
-0.65
POSITIVE LOGITS
apparel
1.13
clothing
1.02
jeans
1.01
worn
1.00
gloves
0.97
underwear
0.96
diapers
0.96
clothes
0.95
shoes
0.93
shirts
0.91
Activations Density 0.041%