INDEX
Explanations
references to beauty and descriptions of aesthetic appeal
New Auto-Interp
Negative Logits
tr
-0.44
tis
-0.43
pu
-0.43
pat
-0.43
tu
-0.42
onga
-0.42
ter
-0.42
grom
-0.42
gn
-0.41
pan
-0.41
POSITIVE LOGITS
beautiful
1.20
beautiful
1.11
beauty
1.07
BEAUTIFUL
1.03
Beautiful
1.03
Beautiful
0.99
BEAUTY
0.97
Beauty
0.96
beauty
0.96
Beauty
0.94
Activations Density 0.219%