INDEX
Explanations
mentions of luxury or extravagant items or experiences
New Auto-Interp
Negative Logits
ubit
-0.20
nik
-0.19
lease
-0.15
earing
-0.15
vent
-0.15
c
-0.14
Ra
-0.14
IVERS
-0.14
-American
-0.14
pent
-0.13
POSITIVE LOGITS
ARRIER
0.17
uzey
0.16
ÐIJÑĢÑħÑĸв
0.15
chied
0.15
/lang
0.15
dre
0.15
erce
0.14
tual
0.14
Origins
0.14
oval
0.14
Activations Density 0.004%