INDEX
Explanations
color-related terms or the word "color"
references to colors or color-related terms
New Auto-Interp
Negative Logits
doms
-1.05
idem
-0.86
_-
-0.85
olicy
-0.78
iddles
-0.73
ammad
-0.73
OTAL
-0.72
uthor
-0.72
=-=-=-=-
-0.71
Xi
-0.70
POSITIVE LOGITS
blind
1.05
palette
1.01
coded
0.86
color
0.83
color
0.82
grain
0.82
pain
0.81
Spray
0.80
="#
0.79
fully
0.79
Activations Density 0.018%