INDEX
    Explanations

    the neuron responds to concrete everyday nouns — common items and things (food, clothing/prints, teams, social-media terms).

    New Auto-Interp
    Negative Logits
    ули
    -0.07
     expulsion
    -0.06
     Алекс
    -0.06
    enses
    -0.06
     Sparse
    -0.06
    orrar
    -0.06
    xB
    -0.06
     endPoint
    -0.06
    rypt
    -0.06
     ethernet
    -0.06
    POSITIVE LOGITS
    Da
    0.07
     xác
    0.07
     knowingly
    0.07
    .matrix
    0.06
     goggles
    0.06
    yscale
    0.06
    NECT
    0.06
    _trial
    0.06
    0.06
     las
    0.06
    Act Density 0.327%

    No Known Activations