INDEX
Explanations
mentions of eyeglasses
references to glasses
New Auto-Interp
Negative Logits
ILY
-0.90
itative
-0.72
actionDate
-0.71
Filename
-0.70
heny
-0.68
NOR
-0.68
published
-0.67
TAIN
-0.66
ESE
-0.65
States
-0.63
POSITIVE LOGITS
glasses
1.47
goggles
1.17
sunglasses
1.05
creen
1.00
linger
0.97
lasses
0.94
lace
0.90
glass
0.89
ocular
0.85
Bottle
0.83
Activations Density 0.009%