INDEX
    Explanations

    references to "pin" in various contexts, particularly related to the social media platform Pinterest

    New Auto-Interp
    Negative Logits
    edor
    -0.19
    ummies
    -0.15
    ÅĻil
    -0.15
    stown
    -0.15
    /display
    -0.15
    inges
    -0.15
    eÅŁit
    -0.15
    ätz
    -0.14
    ë·
    -0.14
    owitz
    -0.14
    POSITIVE LOGITS
    ning
    0.35
     Pin
    0.20
    ners
    0.19
    ny
    0.19
    heiro
    0.19
    NING
    0.19
    Pin
    0.19
    lerce
    0.18
    .Pin
    0.18
    ching
    0.17
    Act Density 0.018%

    No Known Activations