INDEX
    Explanations

    The neuron flags tokens that are part of figure or illustration references (e.g. “Figure,” “Fig.,” section numbers, bracketed or parenthesized figure labels).

    New Auto-Interp
    Negative Logits
    ied
    -0.07
    anda
    -0.06
    esture
    -0.06
    ICH
    -0.06
    GLOSS
    -0.06
     Mixed
    -0.06
    Moves
    -0.06
     Sizes
    -0.06
    imated
    -0.06
     bec
    -0.06
    POSITIVE LOGITS
     inflater
    0.07
    ","+
    0.07
    uito
    0.07
     protože
    0.06
     Rel
    0.06
     بالأ
    0.06
     pz
    0.06
     ghetto
    0.06
    PostalCodesNL
    0.06
    ={{↵
    0.06
    Act Density 0.008%

    No Known Activations