INDEX
Explanations
instances or references to the color blue
New Auto-Interp
Negative Logits
tega
-0.21
rell
-0.18
resden
-0.17
acle
-0.15
nbsp
-0.15
sten
-0.15
lah
-0.15
º
-0.15
../../../
-0.15
seite
-0.15
POSITIVE LOGITS
prints
0.22
berries
0.21
berry
0.20
leaf
0.20
mont
0.17
-green
0.17
bird
0.16
ehir
0.15
/red
0.15
arrow
0.15
Activations Density 0.022%