INDEX
    Explanations

    references to specific images or captions in a news context

    references to specific images or visual content in documents

    New Auto-Interp
    Negative Logits
    pill
    -0.79
    ocr
    -0.68
    hop
    -0.68
    boro
    -0.67
     Norn
    -0.67
    SG
    -0.65
    Sword
    -0.65
    blank
    -0.64
    ··
    -0.63
    lethal
    -0.62
    POSITIVE LOGITS
     Photos
    0.88
    window
    0.87
     Tanks
    0.73
     Caption
    0.69
     WATCHED
    0.69
     kay
    0.65
     FILE
    0.63
     msec
    0.61
     captures
    0.60
     ammon
    0.59
    Act Density 0.063%

    No Known Activations