INDEX
Explanations
instances of people wearing T-shirts with specific words or phrases
references to T-shirts and their variations
New Auto-Interp
Negative Logits
distant
-0.69
presided
-0.66
Reviewer
-0.66
«ĺ
-0.64
20439
-0.63
judicial
-0.63
ruciating
-0.62
ntil
-0.60
audi
-0.60
Trem
-0.59
POSITIVE LOGITS
shirt
1.41
shirts
1.24
hirt
1.02
shirts
1.01
idas
0.92
shirt
0.91
boy
0.86
Shirt
0.86
leeve
0.85
cloth
0.84
Activations Density 0.006%