INDEX
Explanations
images of shirts
references to shirts, particularly T-shirts
New Auto-Interp
Negative Logits
audi
-0.77
yrinth
-0.74
ntil
-0.67
judicial
-0.66
nesota
-0.62
elsen
-0.62
ruciating
-0.61
trib
-0.61
cffffcc
-0.60
enthal
-0.60
POSITIVE LOGITS
shirt
1.21
shirts
1.11
sleeve
1.10
shirts
1.10
leeve
1.06
sleeves
1.03
shirt
1.03
hirt
1.02
rack
0.89
pins
0.88
Activations Density 0.016%