INDEX
Explanations
written descriptions of glasses and their specific features
references to eyewear, specifically glasses and sunglasses
New Auto-Interp
Negative Logits
ILY
-0.96
heny
-0.76
actionDate
-0.74
raltar
-0.74
DAY
-0.71
itative
-0.71
anqu
-0.66
TAIN
-0.65
BTC
-0.64
Mech
-0.63
POSITIVE LOGITS
glasses
1.37
goggles
1.02
lasses
0.97
sunglasses
0.96
creen
0.88
linger
0.87
lace
0.84
tint
0.80
glass
0.79
lenses
0.79
Activations Density 0.023%