INDEX
    Explanations

    words related to images or visual content

    references to images or pictures

    New Auto-Interp
    Negative Logits
    iant
    -0.76
    vernment
    -0.75
    rador
    -0.72
    edient
    -0.72
    aughs
    -0.72
    iance
    -0.70
    bard
    -0.69
    ible
    -0.69
    orne
    -0.69
    hement
    -0.68
    POSITIVE LOGITS
    picture
    0.87
     depicting
    0.84
     galleries
    0.84
    Pic
    0.83
     Pic
    0.82
    Pict
    0.81
    que
    0.80
     img
    0.78
     mosa
    0.78
    >>>>>>>>
    0.78
    Act Density 0.023%

    No Known Activations