INDEX
Explanations
references to clothing items, specifically focusing on long-sleeved shirts
the presence of the term "lee" in various contexts
New Auto-Interp
Negative Logits
Seym
-0.82
onut
-0.76
rices
-0.75
achu
-0.75
urated
-0.71
folios
-0.70
romeda
-0.70
ificial
-0.70
rait
-0.69
ifact
-0.68
POSITIVE LOGITS
pless
1.09
ce
1.00
ves
0.95
vec
0.92
lee
0.91
gha
0.90
ved
0.87
vel
0.87
scill
0.77
urring
0.74
Activations Density 0.046%