INDEX
    Explanations

    mentions of cleaning or removing something

    New Auto-Interp
    Negative Logits
    yip
    -0.99
    abul
    -0.65
    XT
    -0.65
    akening
    -0.64
    trl
    -0.64
    ommod
    -0.64
    PsyNetMessage
    -0.63
    arsity
    -0.63
    Mand
    -0.63
    MP
    -0.62
    POSITIVE LOGITS
    liness
    1.05
     cleaned
    0.92
     ashore
    0.89
     cleaner
    0.88
     linen
    0.85
    clean
    0.85
     towels
    0.80
     cleaners
    0.80
     toilets
    0.79
     stains
    0.79
    Act Density 2.821%

    No Known Activations