INDEX
    Explanations

    references to social media platforms

    mentions of the social media platform Pinterest

    New Auto-Interp
    Negative Logits
     scrimmage
    -0.66
    ~~~~~~~~
    -0.66
    ppo
    -0.65
    lves
    -0.65
    eb
    -0.64
    igh
    -0.63
    Lex
    -0.60
    ocent
    -0.60
    åĤ
    -0.59
    ×Ļ×
    -0.58
    POSITIVE LOGITS
     Pinterest
    1.34
    ileaks
    0.91
    acebook
    0.83
    atchewan
    0.81
    ulkan
    0.77
    Pinterest
    0.75
    etsy
    0.75
    psey
    0.73
    sylv
    0.71
    vine
    0.71
    Act Density 0.006%

    No Known Activations