INDEX
Explanations
words and phrases related to clothing items, specifically T-shirts
references to T-shirts
New Auto-Interp
Negative Logits
antid
-0.70
minimized
-0.65
Helpful
-0.64
prof
-0.63
ãģ®éŃĶ
-0.60
rency
-0.60
âĢ¢âĢ¢
-0.58
wiret
-0.58
EGIN
-0.57
theless
-0.57
POSITIVE LOGITS
shirt
1.18
shirts
1.15
rex
1.01
Rex
0.87
iron
0.86
minus
0.85
edo
0.84
mobile
0.84
spir
0.81
Mobile
0.80
Activations Density 0.037%