INDEX
    Explanations

    cooking instructions and recipes

    New Auto-Interp
    Negative Logits
     Drop
    -0.16
    Drop
    -0.16
    ief
    -0.15
    drop
    -0.15
     drop
    -0.15
    orate
    -0.14
    vis
    -0.14
    ساÙħ
    -0.14
    orient
    -0.14
    ../../../../
    -0.14
    POSITIVE LOGITS
    ureau
    0.17
     Noble
    0.15
     rect
    0.15
    ugas
    0.14
     Hus
    0.14
     pob
    0.14
     Huss
    0.14
     Levin
    0.14
    colo
    0.14
    -NLS
    0.14
    Act Density 0.049%

    No Known Activations