INDEX
    Explanations

    references to paper products or materials

    New Auto-Interp
    Negative Logits
    sb
    -0.20
    eva
    -0.20
    sin
    -0.18
    sa
    -0.18
    sf
    -0.18
    sing
    -0.17
    spb
    -0.17
    sla
    -0.17
    sy
    -0.17
    tics
    -0.17
    POSITIVE LOGITS
    clip
    0.39
    weight
    0.35
    backs
    0.34
    weights
    0.31
    trail
    0.29
     towel
    0.28
    board
    0.27
     towels
    0.26
    mill
    0.26
    work
    0.26
    Act Density 0.026%

    No Known Activations