INDEX
    Explanations

    books and essays

    This neuron lights up on subword pieces containing accented characters (e.g. é, è, ô)—i.e. it detects fragments of Romance‐language words with diacritics.

    New Auto-Interp
    Negative Logits
    -0.06
     ANSW
    -0.06
     자유
    -0.06
     pizza
    -0.06
    ěř
    -0.06
    Speech
    -0.06
    Sw
    -0.06
    iz
    -0.06
     المع
    -0.06
     гем
    -0.06
    POSITIVE LOGITS
     #$
    0.07
    جه
    0.07
     [.
    0.06
     localObject
    0.06
    .FromSeconds
    0.06
    (optional
    0.06
    ;"><?
    0.06
     bourgeois
    0.06
    ]:=
    0.06
     scrimmage
    0.06
    Act Density 0.041%

    No Known Activations