INDEX
    Explanations

    specific food-related or dietary references

    New Auto-Interp
    Negative Logits
     Downs
    -0.15
    yna
    -0.15
    poster
    -0.14
    uire
    -0.14
    ubit
    -0.14
    pd
    -0.14
    uler
    -0.14
    agar
    -0.14
    âij
    -0.13
    ocaly
    -0.13
    POSITIVE LOGITS
    ARIANT
    0.16
    itta
    0.15
    Overrides
    0.15
    .xhtml
    0.15
    .tif
    0.14
    atrix
    0.14
    omi
    0.14
    egade
    0.14
    ignum
    0.14
    ecast
    0.14
    Act Density 0.238%

    No Known Activations