INDEX
    Explanations

    references to food, particularly ice cream and dessert items

    New Auto-Interp
    Negative Logits
     frying
    -0.17
     cooking
    -0.16
     ÅŁar
    -0.16
     bread
    -0.16
    oise
    -0.16
     Marketable
    -0.15
     cooked
    -0.15
     underwater
    -0.15
     Vine
    -0.15
    bread
    -0.15
    POSITIVE LOGITS
     ice
    0.63
     Ice
    0.54
    ice
    0.52
    Ice
    0.48
     ICE
    0.45
    åĨ°
    0.41
    ICE
    0.40
     scoop
    0.39
     icy
    0.35
     sco
    0.35
    Act Density 0.040%

    No Known Activations