INDEX
    Explanations

    images with captions that have been enlarged

    instances of the word "this."

    New Auto-Interp
    Negative Logits
    bats
    -0.75
    76561
    -0.67
    aturdays
    -0.61
     fronts
    -0.61
    termination
    -0.60
    spring
    -0.57
     affairs
    -0.57
     Tend
    -0.57
    naires
    -0.56
     lobb
    -0.56
    POSITIVE LOGITS
     image
    1.00
     ARTICLE
    0.86
     Image
    0.81
     toggle
    0.77
    image
    0.76
    ption
    0.72
    embed
    0.72
    malink
    0.72
     slide
    0.70
     Advertisement
    0.70
    Act Density 0.008%

    No Known Activations