INDEX
Explanations
images with captions
references to specific images or visual elements
New Auto-Interp
Negative Logits
bats
-0.66
apo
-0.65
APD
-0.65
uality
-0.60
naires
-0.59
Sund
-0.59
poke
-0.59
Jet
-0.58
Schwar
-0.58
uls
-0.57
POSITIVE LOGITS
toggle
0.98
image
0.88
thumbnail
0.82
screenshot
0.81
WATCHED
0.79
photo
0.79
microscope
0.78
ARTICLE
0.78
Image
0.77
transcript
0.76
Activations Density 0.013%