INDEX
Explanations
mentions of sweating or sweat-related activities
references to clothing, particularly sweatshirts and shorts
New Auto-Interp
Negative Logits
ational
-0.65
ential
-0.64
gor
-0.63
Pegasus
-0.62
emon
-0.61
Axis
-0.61
Chimera
-0.61
inent
-0.60
ilion
-0.60
operator
-0.59
POSITIVE LOGITS
sweats
3.24
sweat
2.15
jeans
1.91
shorts
1.81
Swe
1.76
sweater
1.54
sneakers
1.38
denim
1.29
swe
1.23
Swe
1.20
Activations Density 0.023%