INDEX
Explanations
references to clothing articles, specifically pants
references to various types of pants and related clothing items
New Auto-Interp
Negative Logits
Scotia
-0.72
KNOWN
-0.70
ende
-0.70
AUT
-0.69
rian
-0.68
Galile
-0.68
Reviewed
-0.67
Kern
-0.66
rology
-0.65
istic
-0.65
POSITIVE LOGITS
pants
1.08
trousers
1.06
hirt
1.05
leeve
1.02
pants
0.97
bag
0.96
jeans
0.95
belt
0.93
sleeves
0.92
uits
0.92
Activations Density 0.028%