INDEX
Explanations
words related to visual or sensory evaluation such as "looks", "tastes", "feels"
descriptions of visual appearances
New Auto-Interp
Negative Logits
limit
-0.73
ulla
-0.69
learning
-0.67
ilings
-0.67
upuncture
-0.67
mental
-0.66
trl
-0.66
Osw
-0.65
venient
-0.64
ference
-0.63
POSITIVE LOGITS
suspic
0.93
like
0.89
awfully
0.88
identical
0.84
strikingly
0.83
blurry
0.81
sleek
0.80
vaguely
0.79
prett
0.79
shiny
0.78
Activations Density 0.068%