INDEX
Explanations
mentions of people or characters wearing specific items of clothing
instances of the word "wearing"
New Auto-Interp
Negative Logits
edia
-0.75
Purg
-0.67
Sources
-0.64
prep
-0.64
Fund
-0.64
Democr
-0.63
Publisher
-0.63
analysis
-0.62
Torrent
-0.62
����
-0.61
POSITIVE LOGITS
worn
1.23
wearer
1.13
wear
1.13
apparel
1.10
wore
1.09
clothing
1.01
wearing
1.00
jeans
0.97
uniforms
0.97
wears
0.97
Activations Density 0.012%