INDEX
Explanations
text pieces indicating image captions with a focus on medical topics
image captions
New Auto-Interp
Negative Logits
onest
-0.85
reborn
-0.72
aturdays
-0.70
empt
-0.69
fleet
-0.66
etheless
-0.66
eday
-0.66
glas
-0.65
quart
-0.65
avorite
-0.65
POSITIVE LOGITS
UTERS
0.87
Provided
0.83
Immun
0.77
=>
0.74
Surveillance
0.73
Image
0.72
Streamer
0.72
IMAGES
0.71
Protesters
0.70
Researchers
0.70
Activations Density 0.065%