INDEX
    Explanations

    terms related to food and dining experiences

    New Auto-Interp
    Negative Logits
    Soup
    -0.20
     spaghetti
    -0.19
     Salad
    -0.18
    éħĴ
    -0.18
    soup
    -0.18
     wine
    -0.17
     soup
    -0.17
     beers
    -0.16
     dinner
    -0.16
     Wine
    -0.15
    POSITIVE LOGITS
     dough
    0.40
     Dough
    0.37
     pastry
    0.32
     bakery
    0.30
     muff
    0.28
     baker
    0.27
     Bakery
    0.26
     past
    0.26
     Baker
    0.24
     don
    0.23
    Act Density 0.102%

    No Known Activations