INDEX
    Explanations

    expressions of enthusiasm and appreciation for food-related experiences

    New Auto-Interp
    Negative Logits
    ian
    -0.14
    segue
    -0.14
    ahu
    -0.14
    rippling
    -0.14
    -alist
    -0.14
     acquitted
    -0.14
    oted
    -0.13
     Toys
    -0.13
    olid
    -0.13
    oyal
    -0.13
    POSITIVE LOGITS
    Bookmark
    0.21
     pinned
    0.20
     bookmark
    0.20
     dro
    0.18
     Mouth
    0.18
     PIN
    0.18
     Bookmark
    0.17
     Pin
    0.17
     mouths
    0.17
    mouth
    0.17
    Act Density 0.016%

    No Known Activations