INDEX
    Explanations

    references to colors and color-related descriptions

    New Auto-Interp
    Negative Logits
    oppers
    -0.16
     Busty
    -0.15
    .scalablytyped
    -0.15
    UBY
    -0.14
    ekim
    -0.14
    chl
    -0.14
    azi
    -0.14
    Ïĥκ
    -0.14
     stringWith
    -0.14
    rello
    -0.14
    POSITIVE LOGITS
     Hamm
    0.17
     chosen
    0.15
    isan
    0.15
     extrav
    0.15
     Weber
    0.15
     Colbert
    0.14
     Crom
    0.14
    ieren
    0.13
    .flink
    0.13
    rib
    0.13
    Act Density 0.017%

    No Known Activations