INDEX
Explanations
references to image copyrights
references to images in the document
New Auto-Interp
Negative Logits
prep
-0.71
roommate
-0.69
certified
-0.68
ass
-0.68
ages
-0.67
ear
-0.67
duty
-0.65
oped
-0.64
du
-0.63
viol
-0.63
POSITIVE LOGITS
Image
4.09
Images
2.26
Image
2.21
image
2.06
Photo
1.93
Media
1.87
Picture
1.57
image
1.47
Images
1.46
images
1.38
Activations Density 0.019%