INDEX
Explanations
inexpensive and beginner items
New Auto-Interp
Negative Logits
elegant
0.47
elegantly
0.46
Luxurious
0.46
优雅
0.46
nuanced
0.45
elegan
0.42
elegante
0.42
erud
0.41
峯
0.41
luxurious
0.41
POSITIVE LOGITS
cheap
1.60
cheap
1.41
cheapest
1.37
inexpensive
1.37
Cheap
1.34
cheaply
1.31
Cheap
1.30
सस्ते
1.30
деше
1.29
cheaper
1.23
Activations Density 0.107%