INDEX
    Explanations

    references to the color pink

    references to the color "pink."

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.72
     TODAY
    -0.69
    ilities
    -0.68
    UGH
    -0.66
    ussed
    -0.66
    ribes
    -0.65
    Downloadha
    -0.64
     condem
    -0.64
    reens
    -0.63
    olkien
    -0.63
    POSITIVE LOGITS
    erton
    1.28
     Floyd
    1.24
    bike
    1.02
    tail
    1.00
    washing
    0.94
     slime
    0.94
    heart
    0.89
    ety
    0.84
    y
    0.84
     Panther
    0.82
    Act Density 0.019%

    No Known Activations