INDEX
    Explanations

    references to color or colors and their combinations in various contexts

    New Auto-Interp
    Negative Logits
    Color
    -0.28
    Colors
    -0.27
    _colors
    -0.27
     colourful
    -0.24
    Colour
    -0.24
     colours
    -0.23
     Colour
    -0.23
     Colors
    -0.23
     colour
    -0.23
    _color
    -0.23
    POSITIVE LOGITS
    -coded
    0.32
     scheme
    0.32
    ation
    0.30
     schemes
    0.30
     Scheme
    0.29
    cheme
    0.27
    fully
    0.27
    blind
    0.26
    Scheme
    0.25
    chemes
    0.25
    Act Density 0.050%

    No Known Activations