INDEX
Explanations
words or phrases related to attractiveness or desirability
descriptors associated with attractiveness
New Auto-Interp
Negative Logits
cedented
-0.88
othe
-0.76
avis
-0.74
iche
-0.73
jer
-0.71
bel
-0.68
cham
-0.68
FIN
-0.66
ibel
-0.65
ignt
-0.65
POSITIVE LOGITS
lure
0.89
attractive
0.86
proposition
0.85
lihood
0.80
Magikarp
0.75
attractiveness
0.75
enticing
0.73
propositions
0.71
contests
0.70
attracts
0.69
Activations Density 0.022%