INDEX
    Explanations

    text snippets

    comparative and superlative forms of adjectives and verbs.

    The neuron fires on runs of underscore characters (i.e. the blank “____” tokens used as placeholders in fill-in-the-blank questions).

    New Auto-Interp
    Negative Logits
    .Exception
    -0.06
     proverb
    -0.06
    (ad
    -0.06
     ark
    -0.06
    ่ท
    -0.06
    .Brand
    -0.06
    주소
    -0.06
     Vij
    -0.06
    PRETTY
    -0.06
    -Year
    -0.06
    POSITIVE LOGITS
    izin
    0.07
    asier
    0.06
     çocu
    0.06
     نتیجه
    0.06
    ..."↵
    0.06
     YY
    0.06
    ну
    0.06
    0.06
    _AX
    0.06
     quant
    0.06
    Act Density 0.005%

    No Known Activations