INDEX
    Explanations

    The neuron detects mentions of political discourse, especially the words “politics” and “political.”

    New Auto-Interp
    Negative Logits
    каз
    -0.08
     glyph
    -0.07
    -0.07
    кат
    -0.06
     درخواست
    -0.06
    -0.06
    Sync
    -0.06
     RAF
    -0.06
     Coch
    -0.06
     Jo
    -0.06
    POSITIVE LOGITS
    _Product
    0.06
    _public
    0.06
    .Errors
    0.06
    üslü
    0.06
    ัณฑ
    0.06
    constant
    0.06
     čím
    0.06
    flate
    0.06
    forg
    0.06
    -lived
    0.06
    Act Density 0.009%

    No Known Activations