INDEX
    Explanations

    elements related to food or cooking

    New Auto-Interp
    Negative Logits
    awe
    -0.15
     Colony
    -0.15
    erve
    -0.15
    اÙĨا
    -0.14
    agues
    -0.14
    ware
    -0.13
     Observer
    -0.13
    åĬĩ
    -0.13
    hest
    -0.13
    _TRUE
    -0.13
    POSITIVE LOGITS
    ellar
    0.15
    ìĿ´ìĸ´
    0.14
    ajor
    0.14
    ãģĹãģªãģĦ
    0.14
     Learned
    0.14
     Pow
    0.14
    ptron
    0.14
    eket
    0.14
    loff
    0.14
     meisten
    0.14
    Act Density 0.371%

    No Known Activations