INDEX
Explanations
references to colors, particularly shades of brown
references to the color brown
New Auto-Interp
Negative Logits
hw
-0.73
eva
-0.73
HF
-0.70
Wage
-0.66
DCS
-0.66
hack
-0.64
Tracks
-0.64
Dest
-0.64
Xi
-0.64
href
-0.64
POSITIVE LOGITS
brown
3.48
brown
2.89
Brown
2.00
Brown
1.94
gray
1.76
yellow
1.76
grey
1.71
green
1.65
pink
1.58
black
1.56
Activations Density 0.015%