INDEX
    Explanations

    items received as gifts and expressions of enthusiasm towards them

    New Auto-Interp
    Negative Logits
     milf
    -1.59
     hairc
    -1.49
     increa
    -1.47
     thut
    -1.40
     fta
    -1.37
     ?...
    -1.36
     ugg
    -1.36
     strick
    -1.36
     !...
    -1.35
     wherea
    -1.34
    POSITIVE LOGITS
     gift
    0.94
     gifts
    0.92
    🎁
    0.80
    <bos>
    0.75
     Gift
    0.74
    gift
    0.74
     Gifts
    0.69
     Christmas
    0.65
    Gift
    0.65
    gif
    0.65
    Act Density 0.419%

    No Known Activations