INDEX
Explanations
adjectives and descriptors that signify status, quality, or prestige
New Auto-Interp
Negative Logits
ellan
-0.14
ÌĨ
-0.14
laure
-0.14
ãĥ©ãĥĥãĤ¯
-0.13
èĩ
-0.13
ÙĦÙĬÙģ
-0.13
ekim
-0.13
ENSIONS
-0.13
aven
-0.13
uang
-0.13
POSITIVE LOGITS
lest
0.15
ly
0.14
"_
0.14
uye
0.14
letal
0.14
еÑĩно
0.14
celik
0.14
Gil
0.14
LY
0.13
946
0.13
Activations Density 0.175%