INDEX
Explanations
colors and related descriptors for objects
New Auto-Interp
Negative Logits
ivol
-0.17
redhead
-0.16
beige
-0.16
golden
-0.15
rede
-0.15
pany
-0.15
èħ°
-0.15
ivory
-0.15
æ¦
-0.15
Orange
-0.15
POSITIVE LOGITS
blue
0.84
Blue
0.82
Blue
0.77
blue
0.73
BLUE
0.73
-blue
0.71
BLUE
0.65
èĵĿ
0.60
_blue
0.59
.blue
0.58
Activations Density 0.053%