INDEX
    Explanations

    This neuron activates on the “preced” substring in legal headings—i.e. it detects occurrences of words like “precedential” (often in “nonprecedential”).

    New Auto-Interp
    Negative Logits
    .reddit
    -0.06
    /../
    -0.06
    .show
    -0.06
    .tile
    -0.06
     tarn
    -0.06
    ketøy
    -0.06
     misogyn
    -0.06
     все
    -0.06
     Deng
    -0.06
     tile
    -0.06
    POSITIVE LOGITS
     заход
    0.07
    _sm
    0.07
     Şubat
    0.07
    roll
    0.07
    лаз
    0.06
    —he
    0.06
     तब
    0.06
    ografie
    0.06
     Perez
    0.06
    adb
    0.06
    Act Density 0.000%

    No Known Activations