INDEX
    Explanations

    The neuron detects tokens expressing the concept of waiting (e.g. “待ちます,” “待,” “wait”).

    New Auto-Interp
    Negative Logits
    ίνεται
    -0.07
    _FIRE
    -0.07
     POSIX
    -0.06
    -0.06
    matching
    -0.06
     frustrations
    -0.06
     проблема
    -0.06
    INDOW
    -0.06
     todas
    -0.06
     nước
    -0.06
    POSITIVE LOGITS
    Shortcut
    0.07
    0.06
    .↵↵
    0.06
    ublic
    0.06
    economic
    0.06
     nickname
    0.06
    (trim
    0.06
    Sections
    0.06
    #$
    0.06
     docks
    0.06
    Act Density 0.028%

    No Known Activations