INDEX
Explanations
references to the word "Gray" and its variations in the context of hound breeds or colors
New Auto-Interp
Negative Logits
ufen
-0.16
aclass
-0.16
ogan
-0.15
iyat
-0.15
erged
-0.15
oret
-0.14
rieb
-0.14
uib
-0.14
arrass
-0.14
inez
-0.14
POSITIVE LOGITS
hound
0.29
Matter
0.20
matter
0.20
-haired
0.19
son
0.19
-scale
0.19
SCALE
0.18
skies
0.18
anatomy
0.18
literature
0.17
Activations Density 0.010%