INDEX
    Explanations

    mentions of various types of diets and dietary practices

    New Auto-Interp
    Negative Logits
    ion
    -0.18
    åı£
    -0.16
    fol
    -0.16
    3
    -0.15
    amas
    -0.14
    2
    -0.14
    ment
    -0.14
    most
    -0.14
    ction
    -0.14
    mand
    -0.14
    POSITIVE LOGITS
    bih
    0.15
    anan
    0.15
    첨ë¶Ģ
    0.15
    oundingBox
    0.14
     æĬķ稿æĹ¥
    0.14
    udas
    0.14
    ted
    0.14
    readcr
    0.14
    ardım
    0.14
    ìĭľìĺ¤
    0.13
    Act Density 0.017%

    No Known Activations