INDEX
    Explanations

    punctuation

    This neuron detects numeric score/confidence tokens — floating-point numbers (decimals) present in the text.

    New Auto-Interp
    Negative Logits
    python
    -0.07
    承担
    -0.07
    סטודנט
    -0.06
    %%%%%%%%%%%%%%%%
    -0.06
    -0.06
    unch
    -0.06
    募集
    -0.06
    -0.06
     nationals
    -0.06
     Broadcom
    -0.06
    POSITIVE LOGITS
    الجزائر
    0.08
    หลายคน
    0.08
    ählt
    0.07
    0.07
    aldi
    0.07
    stellung
    0.06
    ategorized
    0.06
    0.06
    .Concat
    0.06
    аль
    0.06
    Act Density 0.087%

    No Known Activations