INDEX
Explanations
words related to visual perception and observation
references to vision or visual perception
New Auto-Interp
Negative Logits
ulative
-0.72
currency
-0.67
afort
-0.66
accompan
-0.65
emetery
-0.64
astically
-0.62
aceae
-0.60
orously
-0.59
itates
-0.58
idity
-0.58
POSITIVE LOGITS
seeing
1.90
lights
1.11
ings
1.10
mares
1.03
lines
0.98
unseen
0.94
ening
0.92
glass
0.91
line
0.89
nings
0.87
Activations Density 0.045%