INDEX
Explanations
words related to attractiveness or desirability
concepts related to attractiveness or interest to people, particularly in the context of products, ideas, or individuals
New Auto-Interp
Negative Logits
Ñĥ
-0.78
Colleges
-0.74
ifa
-0.72
Rost
-0.67
metal
-0.66
Berk
-0.66
kson
-0.65
Brut
-0.63
Coh
-0.62
fters
-0.62
POSITIVE LOGITS
Flavoring
1.12
ingly
0.97
yrinth
0.94
ocene
0.92
ously
0.88
minist
0.84
atism
0.84
ĸļ
0.81
ikawa
0.74
eals
0.74
Activations Density 0.017%