INDEX
    Explanations

    on your toes/feet/knees

    New Auto-Interp
    Negative Logits
     thumb
    0.77
     pullover
    0.73
     shoe
    0.72
    出一个
    0.69
     Schritt
    0.69
     chauss
    0.69
     flap
    0.67
     sock
    0.67
    Sock
    0.65
     руку
    0.65
    POSITIVE LOGITS
     knees
    0.92
    fees
    0.79
     feet
    0.78
     feat
    0.77
     Kne
    0.75
     fours
    0.74
     구현
    0.73
     wheels
    0.73
     heels
    0.72
     monomials
    0.72
    Act Density 0.012%

    No Known Activations