INDEX
    Explanations

    references to colors, particularly shades of yellow

    mentions of the color yellow

    New Auto-Interp
    Negative Logits
    rative
    -0.88
    enegger
    -0.76
    itia
    -0.75
    acters
    -0.71
    ichick
    -0.70
    weeney
    -0.69
    etimes
    -0.69
    ounter
    -0.67
     las
    -0.65
    ãĤ°
    -0.65
    POSITIVE LOGITS
     Yellow
    1.02
     Fever
    1.00
     Jacket
    0.91
    knife
    0.89
     Voy
    0.84
     Jackets
    0.83
    Yellow
    0.83
     Route
    0.82
     Shirt
    0.80
     Matters
    0.80
    Act Density 0.009%

    No Known Activations