INDEX
    Explanations

    programming-related keywords and syntax

    The neuron detects the start-of-sequence (beginning-of-document) token.

    New Auto-Interp
    Negative Logits
    出版年
    -1.00
     ब्रेकडाउन
    -0.86
     تضيفلها
    -0.74
    KommentareTeilen
    -0.73
    <unused8>
    -0.72
    باردا
    -0.72
    <unused43>
    -0.72
    <unused16>
    -0.72
    <pad>
    -0.72
    [@BOS@]
    -0.71
    POSITIVE LOGITS
    import
    0.52
    <strong>
    0.51
    <h1>
    0.50
    <
    0.47
    <h2>
    0.46
    The
    0.44
    I
    0.42
    ↵↵
    0.42
    [
    0.42
    0.42
    Act Density 0.000%

    No Known Activations