INDEX
    Explanations

    elements related to craftsmanship and intricate design

    New Auto-Interp
    Negative Logits
    üst
    -0.17
    ersed
    -0.16
     genu
    -0.15
     Stap
    -0.14
    iltr
    -0.14
    ниÑĤ
    -0.14
     patched
    -0.14
     neutr
    -0.13
     decom
    -0.13
    æķ£
    -0.13
    POSITIVE LOGITS
     car
    0.35
     carve
    0.32
     carving
    0.32
    -car
    0.31
     Car
    0.31
     carved
    0.29
     et
    0.29
    car
    0.29
     ch
    0.27
    Car
    0.27
    Act Density 0.127%

    No Known Activations