INDEX
Explanations
terms related to aesthetic qualities or appearances
references to aesthetic qualities and visual appeal
New Auto-Interp
Negative Logits
etime
-0.83
king
-0.74
woods
-0.69
kers
-0.69
idden
-0.68
sen
-0.67
house
-0.65
abad
-0.64
quist
-0.63
atchewan
-0.62
POSITIVE LOGITS
aesthetic
1.03
sensibilities
0.97
aesthetics
0.89
choices
0.80
preferences
0.80
hetically
0.79
flair
0.79
tastes
0.77
preference
0.77
atically
0.74
Activations Density 0.011%