INDEX
Explanations
images with captions that have been enlarged
instances of the word "this."
New Auto-Interp
Negative Logits
bats
-0.75
76561
-0.67
aturdays
-0.61
fronts
-0.61
termination
-0.60
spring
-0.57
affairs
-0.57
Tend
-0.57
naires
-0.56
lobb
-0.56
POSITIVE LOGITS
image
1.00
ARTICLE
0.86
Image
0.81
toggle
0.77
image
0.76
ption
0.72
embed
0.72
malink
0.72
slide
0.70
Advertisement
0.70
Activations Density 0.008%