INDEX
Explanations
words related to luxury or lavishness
New Auto-Interp
Negative Logits
ieder
-0.17
opsis
-0.16
edral
-0.16
PACKAGE
-0.15
اÙĩ
-0.15
cestor
-0.15
iams
-0.14
梨
-0.14
omes
-0.14
KERNEL
-0.14
POSITIVE LOGITS
ender
0.28
ENDER
0.24
ishly
0.24
atory
0.23
enders
0.22
arel
0.21
rov
0.20
igne
0.20
endar
0.19
atories
0.19
Activations Density 0.006%