INDEX
    Explanations

    references to colors, especially combinations involving blue, red, and yellow

    elements that pertain to colors and color combinations

    New Auto-Interp
    Negative Logits
     Collider
    -0.84
     Authors
    -0.77
    issance
    -0.77
    INST
    -0.76
    SHIP
    -0.76
    Privacy
    -0.75
    utenberg
    -0.75
    essor
    -0.74
    Account
    -0.73
    Plugin
    -0.72
    POSITIVE LOGITS
     purple
    1.77
     orange
    1.77
     yellow
    1.75
     blue
    1.63
     green
    1.61
     violet
    1.61
    yellow
    1.59
     grey
    1.58
     gray
    1.56
    orange
    1.53
    Act Density 0.094%

    No Known Activations