INDEX
Explanations
images or photographs mentioned in the text
New Auto-Interp
Negative Logits
alties
-0.69
mental
-0.67
mails
-0.66
zig
-0.66
utenberg
-0.66
ally
-0.66
omsky
-0.65
few
-0.64
cession
-0.64
sense
-0.64
POSITIVE LOGITS
above
1.15
below
1.08
pictured
0.97
above
0.97
Above
0.94
circa
0.91
smiling
0.89
below
0.87
flanked
0.87
atop
0.85
Activations Density 0.032%