INDEX
    Explanations

    references to specific food items or ingredients

    New Auto-Interp
    Negative Logits
    ered
    -0.15
    roma
    -0.15
     Fahr
    -0.14
    мо
    -0.14
    ÑĢом
    -0.14
    meer
    -0.14
    reds
    -0.14
     Ed
    -0.14
    ATUS
    -0.14
     matt
    -0.14
    POSITIVE LOGITS
    ome
    0.27
    ears
    0.25
    esto
    0.23
    anko
    0.23
    imiento
    0.21
    OME
    0.21
    ât
    0.21
    umper
    0.20
    ate
    0.19
    ita
    0.19
    Act Density 0.009%

    No Known Activations