INDEX
    Explanations

    colors and related descriptors for objects

    New Auto-Interp
    Negative Logits
    ivol
    -0.17
     redhead
    -0.16
     beige
    -0.16
     golden
    -0.15
    rede
    -0.15
    pany
    -0.15
    èħ°
    -0.15
     ivory
    -0.15
    æ¦
    -0.15
     Orange
    -0.15
    POSITIVE LOGITS
     blue
    0.84
     Blue
    0.82
    Blue
    0.77
    blue
    0.73
     BLUE
    0.73
    -blue
    0.71
    BLUE
    0.65
    èĵĿ
    0.60
    _blue
    0.59
    .blue
    0.58
    Act Density 0.053%

    No Known Activations