INDEX
    Explanations

    specific color names and their variations

    New Auto-Interp
    Negative Logits
    ammers
    -0.17
    ongan
    -0.16
    izza
    -0.15
    igmoid
    -0.14
     sole
    -0.14
    onya
    -0.14
     Sole
    -0.14
    íĬ
    -0.14
    uncio
    -0.14
    968
    -0.14
    POSITIVE LOGITS
    æ¦ľ
    0.16
    νÏĦ
    0.15
    shint
    0.15
    DET
    0.15
    DEX
    0.15
    ipar
    0.14
    ucht
    0.14
    ç§ĺ
    0.14
    hook
    0.14
    CADE
    0.14
    Act Density 0.266%

    No Known Activations