INDEX
Explanations
concepts related to health improvement and wellness
after "stay"
describing positive states of being
New Auto-Interp
Negative Logits
awkwardly
-0.52
الحياه
-0.51
Worse
-0.50
<bos>
-0.48
underwhelming
-0.48
Etimología
-0.47
やや
-0.47
どうしても
-0.47
degrad
-0.47
disappointing
-0.46
POSITIVE LOGITS
healthy
1.22
healthier
1.17
happy
1.10
happier
1.05
Healthy
1.03
healthiest
1.02
healthy
1.02
Healthy
1.00
HAPPY
0.93
heureuse
0.93
Activations Density 0.208%