INDEX
Explanations
adjectives or phrases related to visual appearance and design
New Auto-Interp
Negative Logits
urdue
-0.82
uesday
-0.69
hement
-0.68
illet
-0.66
Together
-0.65
ycle
-0.63
igham
-0.62
jri
-0.60
ital
-0.59
improv
-0.59
POSITIVE LOGITS
beh
0.95
ears
0.92
ĸļ
0.86
hairs
0.85
sockets
0.82
itness
0.82
catching
0.79
watering
0.79
eye
0.78
noses
0.75
Activations Density 0.072%