INDEX
    Explanations

    laundry-related phrases

    references to laundry and related tasks

    New Auto-Interp
    Negative Logits
    */(
    -0.85
    alez
    -0.82
    olar
    -0.79
    itol
    -0.72
    umar
    -0.72
    ulhu
    -0.72
    pps
    -0.72
    ologies
    -0.72
    oid
    -0.72
    ioch
    -0.71
    POSITIVE LOGITS
     laundry
    1.18
    ©¶æ¥µ
    0.87
     laund
    0.78
    robe
    0.76
     soap
    0.75
     basket
    0.73
     closet
    0.71
     undert
    0.71
    stairs
    0.70
    æ©Ł
    0.69
    Act Density 0.005%

    No Known Activations