INDEX
    Explanations

    phrases related to images and their captions

    references to images and their captions

    New Auto-Interp
    Negative Logits
    body
    -0.68
    DERR
    -0.65
    pole
    -0.62
    Condition
    -0.60
    Shape
    -0.59
    Advertisements
    -0.58
    ificant
    -0.58
    iqueness
    -0.58
    âĢİ
    -0.57
     prophe
    -0.57
    POSITIVE LOGITS
     toggle
    1.53
     Courtesy
    1.01
     REUTERS
    0.85
     FILE
    0.80
     screenshot
    0.74
     Image
    0.73
     Mehran
    0.71
     Chip
    0.70
     AFP
    0.70
     shutter
    0.69
    Act Density 0.016%

    No Known Activations