INDEX
Explanations
references to bags and purses
New Auto-Interp
Negative Logits
íĤ¹
-0.17
shirt
-0.17
ije
-0.16
uniforms
-0.16
rophe
-0.15
shirts
-0.15
shirt
-0.15
gency
-0.14
roph
-0.14
Uniform
-0.13
POSITIVE LOGITS
bag
0.36
bags
0.36
purse
0.33
purs
0.33
Bag
0.31
bag
0.31
Bags
0.31
bags
0.30
_bag
0.30
Bag
0.29
Activations Density 0.027%