INDEX
    Explanations

    The neuron detects terms that signal open or awaiting‐action items (e.g. pending or outstanding entries).

    New Auto-Interp
    Negative Logits
    IFn
    -0.08
    พวก
    -0.07
    рет
    -0.07
    148
    -0.06
     orbits
    -0.06
     ساخته
    -0.06
     Ning
    -0.06
    reten
    -0.06
    mayacak
    -0.06
    -0.06
    POSITIVE LOGITS
     afflict
    0.06
     (**
    0.06
    ."↵↵↵↵
    0.06
    (elem
    0.06
     neck
    0.06
    promo
    0.06
    agenda
    0.06
     adolescente
    0.06
     GMC
    0.06
     Neck
    0.06
    Act Density 0.034%

    No Known Activations