INDEX
Explanations
phrases indicating value or worth
New Auto-Interp
Negative Logits
Diwedd
-0.60
LabelTagHelper
-0.60
UpInside
-0.59
Décès
-0.56
Хьажоргаш
-0.56
المعيارى
-0.53
eneral
-0.53
Drapeau
-0.52
BindView
-0.52
chelsea
-0.52
POSITIVE LOGITS
worth
4.21
Worth
3.22
WORTH
3.14
Worth
3.13
worth
2.84
WORTH
2.40
worthwhile
2.15
worthy
2.07
值得
1.65
Worthy
1.64
Activations Density 0.104%