INDEX
    Explanations

    references to headwear and related clothing items

    New Auto-Interp
    Negative Logits
    yte
    -0.15
    arda
    -0.15
     bufsize
    -0.14
    MOVED
    -0.14
    oku
    -0.13
    inia
    -0.13
    enia
    -0.13
    ι
    -0.13
     ydk
    -0.13
    ayout
    -0.13
    POSITIVE LOGITS
     hat
    0.47
     hats
    0.44
     Hat
    0.38
    hat
    0.37
    帽
    0.37
     Hats
    0.35
    Hat
    0.34
     caps
    0.32
    _hat
    0.30
     straw
    0.30
    Act Density 0.035%

    No Known Activations