INDEX
Explanations
references to bags and purses
handbag and purse
New Auto-Interp
Negative Logits
Shirt
-0.50
Houſe
-0.49
ſelf
-0.49
enfans
-0.48
Diſ
-0.48
Jefus
-0.46
shirt
-0.46
MigrationBuilder
-0.45
ſch
-0.45
$_(
-0.45
POSITIVE LOGITS
handbag
0.84
handbags
0.81
purses
0.73
purse
0.72
Purse
0.61
crossbody
0.57
👜
0.55
toreb
0.54
bag
0.54
bags
0.52
Activations Density 0.008%