INDEX
Explanations
image credits or captions
visual content such as images and their descriptions
New Auto-Interp
Negative Logits
lies
-0.85
merce
-0.77
unin
-0.76
sole
-0.74
laws
-0.73
tml
-0.72
gger
-0.72
dies
-0.70
unts
-0.70
cair
-0.69
POSITIVE LOGITS
caption
1.07
Images
1.01
Image
1.00
Image
0.99
Gallery
0.99
Thumbnails
0.93
Images
0.92
IMAGES
0.89
Comics
0.85
Courtesy
0.82
Activations Density 0.018%