INDEX
    Explanations

    images or photographs mentioned in a text

    references to images or photos being shared or posted

    New Auto-Interp
    Negative Logits
    merce
    -0.90
    schild
    -0.90
    ensable
    -0.81
    endo
    -0.74
    lear
    -0.72
    ENDED
    -0.72
    osponsors
    -0.70
    DEF
    -0.70
    EED
    -0.69
    gaard
    -0.68
    POSITIVE LOGITS
     depicting
    1.01
     caption
    0.88
     Snapchat
    0.79
     photograph
    0.76
     portraying
    0.74
     photo
    0.74
     screenshot
    0.74
     showing
    0.73
    journal
    0.72
     img
    0.72
    Act Density 0.053%

    No Known Activations