INDEX
    Explanations

    emotional expressions of affection and warmth in interactions

    New Auto-Interp
    Negative Logits
     Plug
    -0.40
    Plug
    -0.40
     plug
    -0.39
     intéress
    -0.38
    plug
    -0.37
     PLUG
    -0.37
     reprezent
    -0.34
    Interess
    -0.34
    Crud
    -0.34
    ंदीखरीदारी
    -0.33
    POSITIVE LOGITS
     hug
    1.93
     hugs
    1.88
     hugging
    1.81
     hugged
    1.74
     kisses
    1.74
     kiss
    1.73
     embrace
    1.55
     kissed
    1.54
     kissing
    1.54
    hug
    1.53
    Act Density 0.231%

    No Known Activations