INDEX
    Explanations

    content related to news articles or headlines

    commands related to enlarging or toggling images

    New Auto-Interp
    Negative Logits
    warm
    -0.65
    naires
    -0.65
    angering
    -0.62
    angers
    -0.62
    nuts
    -0.62
    anger
    -0.62
    onew
    -0.61
     upt
    -0.60
    comes
    -0.60
     sucker
    -0.60
    POSITIVE LOGITS
    Enlarge
    0.94
    ONSORED
    0.92
    UTERS
    0.85
     Image
    0.85
     WATCHED
    0.85
     caption
    0.79
     toggle
    0.73
     PHOTO
    0.73
     Photo
    0.71
     Reached
    0.69
    Act Density 0.011%

    No Known Activations