INDEX
Explanations
items related to decorative elements
New Auto-Interp
Negative Logits
ercul
-0.14
illez
-0.14
ieves
-0.13
kles
-0.13
vap
-0.13
گاÙĨ
-0.13
orney
-0.12
ilder
-0.12
pok
-0.12
åĨ
-0.12
POSITIVE LOGITS
blue
0.58
yellow
0.57
white
0.52
orange
0.52
green
0.51
red
0.50
brown
0.48
gray
0.47
purple
0.47
grey
0.46
Activations Density 0.994%