INDEX
Explanations
images with captions to "Enlarge" and "toggle"
New Auto-Interp
Negative Logits
apo
-0.72
naires
-0.71
bats
-0.68
termination
-0.62
Schwar
-0.61
fronts
-0.59
Sund
-0.58
gran
-0.57
rang
-0.55
OWS
-0.55
POSITIVE LOGITS
image
0.90
toggle
0.87
ARTICLE
0.85
ption
0.83
screenshot
0.76
photo
0.73
slide
0.72
thumbnail
0.72
microscope
0.71
Image
0.69
Activations Density 0.014%