INDEX
    Explanations

    positive emotions and expressions of happiness

    happiness or positive emotions

    New Auto-Interp
    Negative Logits
     finesse
    -0.45
    ご注意
    -0.39
     save
    -0.39
    importanza
    -0.38
     nonUne
    -0.38
     fascinated
    -0.38
    -0.36
    hilangan
    -0.36
     subtlety
    -0.36
    itoare
    -0.35
    POSITIVE LOGITS
     positivity
    0.80
     upbeat
    0.77
     smile
    0.77
    Smile
    0.77
     cheerful
    0.77
     optimism
    0.76
     Smile
    0.73
    optimis
    0.73
     smiling
    0.72
     smiles
    0.72
    Act Density 0.362%

    No Known Activations