INDEX
Explanations
words related to beauty and aesthetic appreciation
New Auto-Interp
Negative Logits
beauty
-0.19
Beauty
-0.19
beaut
-0.18
Beauty
-0.18
Beautiful
-0.17
ativ
-0.17
Beaut
-0.16
isure
-0.15
beautiful
-0.15
Beautiful
-0.15
POSITIVE LOGITS
lest
0.34
-looking
0.20
ness
0.19
thing
0.19
irony
0.17
mente
0.17
zza
0.17
smelling
0.17
little
0.16
weather
0.16
Activations Density 0.080%