INDEX
    Explanations

    requests and demands

    The neuron activates on foreign‐language (non-English) word fragments—particularly Slavic/Slovene tokens with diacritics.

    New Auto-Interp
    Negative Logits
     Lux
    -0.06
     Ner
    -0.06
     Applied
    -0.06
     grinder
    -0.06
    _rd
    -0.06
     election
    -0.06
     considering
    -0.06
     applied
    -0.06
     possibility
    -0.06
    ita
    -0.06
    POSITIVE LOGITS
    ději
    0.07
    /css
    0.07
    acomment
    0.07
    navbar
    0.06
    taient
    0.06
     dk
    0.06
     ¡
    0.06
    .sap
    0.06
    _kategori
    0.06
    (iv
    0.06
    Act Density 0.177%

    No Known Activations