INDEX
    Explanations

    The neuron flags phrases where the writer refers to their own investigative process (e.g. “I did some research,” “after some trial and error,” “I discovered”).

    New Auto-Interp
    Negative Logits
    avid
    -0.07
     ConsoleColor
    -0.06
    -0.06
     Ảnh
    -0.06
    دة
    -0.06
    ored
    -0.06
     Encryption
    -0.06
     cắt
    -0.06
    ipients
    -0.06
     supportive
    -0.06
    POSITIVE LOGITS
    ráf
    0.08
    .พ
    0.07
    0.07
     Summit
    0.07
    0.07
    _bm
    0.07
     heterogeneous
    0.06
     pound
    0.06
     mainland
    0.06
    více
    0.06
    Act Density 0.022%

    No Known Activations