INDEX
    Explanations

    mentions of popular food items or dishes

    New Auto-Interp
    Negative Logits
     Garland
    -0.17
    adro
    -0.15
    holm
    -0.15
     dul
    -0.15
     Kok
    -0.14
     pij
    -0.14
    rael
    -0.14
    ofilm
    -0.14
     Hubbard
    -0.14
     colum
    -0.14
    POSITIVE LOGITS
     burger
    0.43
     burgers
    0.40
     Burg
    0.40
     bun
    0.37
     Burger
    0.36
    burg
    0.35
     patt
    0.35
    burger
    0.34
     hamburg
    0.32
     hamburger
    0.31
    Act Density 0.044%

    No Known Activations