INDEX
    Explanations

    references to the color yellow or locations with "Yellow" in the name

    New Auto-Interp
    Negative Logits
    tto
    -0.79
    cffff
    -0.77
    Downloadha
    -0.75
    76561
    -0.73
    inion
    -0.73
    ========
    -0.72
    ilities
    -0.71
    isSpecialOrderable
    -0.70
    natureconservancy
    -0.69
     mathemat
    -0.68
    POSITIVE LOGITS
    cake
    0.96
     fever
    0.94
    knife
    0.93
    stone
    0.90
    ish
    0.85
    prints
    0.82
    efined
    0.82
     ribbon
    0.80
    beard
    0.80
    heart
    0.80
    Act Density 0.012%

    No Known Activations