INDEX
Explanations
highly attractive attributes or characteristics
repeated mentions of the word "attractive" in various contexts
New Auto-Interp
Negative Logits
cedented
-0.87
cham
-0.87
othe
-0.83
ignt
-0.79
bel
-0.77
othing
-0.76
iche
-0.76
feeding
-0.75
Airl
-0.75
apolis
-0.71
POSITIVE LOGITS
lure
0.96
proposition
0.92
attractive
0.88
prospects
0.85
prospect
0.84
targets
0.79
tempt
0.77
attractiveness
0.76
attraction
0.73
attract
0.73
Activations Density 0.062%