INDEX
Explanations
descriptions or mentions of beauty
references to the concept of beauty
New Auto-Interp
Negative Logits
arbon
-0.83
hov
-0.75
renheit
-0.73
kson
-0.72
eded
-0.72
sol
-0.66
eding
-0.66
orum
-0.66
imov
-0.64
ÑĢ
-0.64
POSITIVE LOGITS
pageant
1.04
beauty
0.83
queen
0.77
contests
0.77
contestant
0.76
Nicole
0.71
queens
0.70
salon
0.67
secrets
0.64
issance
0.64
Activations Density 0.009%