INDEX
Explanations
references to specific images or captions in a news context
references to specific images or visual content in documents
New Auto-Interp
Negative Logits
pill
-0.79
ocr
-0.68
hop
-0.68
boro
-0.67
Norn
-0.67
SG
-0.65
Sword
-0.65
blank
-0.64
··
-0.63
lethal
-0.62
POSITIVE LOGITS
Photos
0.88
window
0.87
Tanks
0.73
Caption
0.69
WATCHED
0.69
kay
0.65
FILE
0.63
msec
0.61
captures
0.60
ammon
0.59
Activations Density 0.063%