INDEX
Explanations
mentions of personal or public reputation or perception
references to the concept of 'image' in various contexts
New Auto-Interp
Negative Logits
hurst
-0.88
endez
-0.84
ighters
-0.83
ighter
-0.81
merce
-0.78
ugs
-0.78
woods
-0.76
uum
-0.74
cffff
-0.73
odo
-0.73
POSITIVE LOGITS
caption
0.92
image
0.92
macros
0.88
gallery
0.83
depicting
0.82
images
0.82
gallery
0.80
UAL
0.78
photographed
0.76
blurred
0.76
Activations Density 0.028%