INDEX
Explanations
words related to the concept of sweating
references to physical sensations and clothing, especially sweaters
New Auto-Interp
Negative Logits
Pax
-0.84
Drone
-0.70
olars
-0.69
FX
-0.69
Peak
-0.68
etus
-0.66
amus
-0.65
Orbital
-0.65
Padres
-0.64
anyon
-0.63
POSITIVE LOGITS
swe
3.93
sweater
1.33
tal
1.26
hatt
0.98
batt
0.97
swe
0.95
Nigel
0.93
floral
0.92
ra
0.92
fing
0.89
Activations Density 0.037%