INDEX
    Explanations

    multiple languages

    This neuron activates on tokens containing non-ASCII or accented characters, i.e. foreign-language words.

    New Auto-Interp
    Negative Logits
     excell
    -0.07
     Equity
    -0.07
     यद
    -0.07
    kaz
    -0.07
     speaks
    -0.07
    -cert
    -0.06
    "Our
    -0.06
    18
    -0.06
    LANGUAGE
    -0.06
    для
    -0.06
    POSITIVE LOGITS
    ('{}
    0.06
    0.06
     링크
    0.06
     Utf
    0.06
    .em
    0.06
     Mehmet
    0.06
    _W
    0.06
     taj
    0.06
     BUFF
    0.06
     whim
    0.06
    Act Density 0.169%

    No Known Activations