INDEX
    Explanations

    phrases related to food and cooking

    New Auto-Interp
    Negative Logits
     sugar
    -0.16
     маÑģло
    -0.15
     sugars
    -0.15
     Baker
    -0.15
    lobal
    -0.15
    tica
    -0.15
     ä½ı
    -0.15
     meisjes
    -0.14
    rán
    -0.14
    dess
    -0.14
    POSITIVE LOGITS
     soup
    0.53
     Soup
    0.47
    soup
    0.43
    Soup
    0.41
     broth
    0.41
     sou
    0.39
     Sou
    0.34
    湯
    0.32
    _soup
    0.31
    Sou
    0.31
    Act Density 0.081%

    No Known Activations