INDEX
Explanations
descriptive language related to imagery
concepts related to visual imagery and symbolism in various contexts
New Auto-Interp
Negative Logits
clud
-0.83
lad
-0.73
Tax
-0.70
galitarian
-0.64
ership
-0.64
Journal
-0.63
bern
-0.63
tein
-0.63
erm
-0.62
gres
-0.60
POSITIVE LOGITS
imagery
0.92
uggest
0.83
icz
0.81
ickr
0.78
abwe
0.75
icol
0.72
ays
0.71
destro
0.70
seys
0.70
anes
0.70
Activations Density 0.055%