INDEX
    Explanations

    food-related terms such as dishes and flavors

    nouns and terms related to food, music, crime, and emotional states

    New Auto-Interp
    Negative Logits
    ucc
    -0.69
    lear
    -0.64
    ivism
    -0.63
    culture
    -0.59
    kus
    -0.59
    ivist
    -0.59
    olog
    -0.59
    士
    -0.58
    lopp
    -0.58
    psons
    -0.58
    POSITIVE LOGITS
     respectively
    0.72
     Hots
    0.68
     bordering
    0.63
     preceded
    0.62
     unheard
    0.60
     apiece
    0.60
     Leilan
    0.60
     followed
    0.58
    *.
    0.58
     flanked
    0.57
    Act Density 0.546%

    No Known Activations